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ABSTRACT 


This  study  was  designed  to  investigate  the  extent  of 
socio-economic  bias  in  selected  intelligence  tests,  with  a 
view  to  identifying  instruments  which  measure  general  intel¬ 
ligence  with  a  minimum  of  such  bias.  Intelligence  was  re¬ 
garded  statistically  as  a  hierarchical  construct,  charac¬ 
terized  chiefly  by  a  broad  general  factor,  and  narrower 
ability  and  environmental  factors.  Factor  analysis  load¬ 
ings  on  g,  the  general  intellectual  component  common  to  a 
variety  of  mental  tests ,  were  taken  as  a  major  criterion  of 
the  construct  validity  of  each  intelligence  test,  while  the 
extent  of  cultural  loadings  was  determined  by  such  criteria 
as  relationship  with  socio-economic  status  at  different  age 
levels,  dependence  on  acquired  knowledge,  and  patterns  of 
item  discrimination  across  socio-economic  levels. 

Nine  promising  culture-reduced  intelligence  tests  and 
sub-tests  were  administered  to  a  representative  Edmonton 
sample  of  271  Grade  VII  children,  and  the  results  analyzed 
in  relation  to  measures  of  socio-economic  status,  verbal  in¬ 
telligence,  achievement,  and  school  marks.  City-wide  testing 
surveys  conducted  during  the  previous  four  yeqrs  provided 
data  on  the  same  children,  making  longitudinal  comparisons 
possible  o 

The  testing  of  four  hypotheses  yielded  the  following 
results:  1.  Principal  Components  factor  analyses  of  twenty- 

seven  variables  identified  three  culture-reduced  tests  with 
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high  g  loadings  accompanied  by  neglibible  group  factor  loadings. 

2.  Eight  of  the  culture-reduced  intelligence  tests 
differentiated  between  socio-economic  levels  significantly  less 
than  did  verbal  intelligence  and  standardized  achievement  tests. 

3  The  correlation  between  socio-economic  status  and  the 
culture-reduced  Progressive  Matrices  showed  no  change  between 
Grades  III  and  VII,  while  the  corresponding  correlation  for  the 
California  Test  of  Mental  Maturity  increased  significantly. 

4.  Items  from  the  Progressive  Matrices  showed  signifi¬ 
cantly  less  variation  in  their  ability  to  discriminate  between 
socio-economic  levels  than  items  from  a  verbal  intelligence 
sub-test. 

On  the  basis  of  eight  standards  of  evaluation  used  in  the 
investigation,  Raven’s  Standard  Progressive  Matrices  was  iden¬ 
tified  as  the  most  useful  culture-reduced  measure  of  intelli¬ 
gence  in  the  battery,  as  it  was  the  only  instrument  to  satisfy 
all  criteria.  The  Cattell  test  of  g  and  the  Lorge -Thorndike 
Figure  Analogies  sub-test  were  ranked  next,  since  they  met  most 
of  the  criteria  in  less  than  half  the  testing  time  of  the 
Matrices . 

Practical  implications  were  seen  for  several  educational 
and  psychological  situations,  and  suggestions  were  made  for 
future  test  construction.  Theoretically,  support  was  found  for 
the  hypothesis  of  socio-economic  bias  in  mental  test  performance, 
and  for  the  usefulness  of  g  as  a  construct  relatively  inde¬ 
pendent  of  environmental  differences  in  school  testing  situa¬ 


tions  . 
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CHAPTER  I 


THE  GENERAL  PROBLEM 
Status  of  Intelligence  Tests 

The  general  mental  test  stands  today  as  the  most  impor¬ 
tant  single  contribution  of  psychology  to  the  practical 
guidance  of  human  affairs.  (Cronbach,  I960,  p.  157) 

Evidence  to  support  this  estimate  of  the  contem¬ 
porary  status  of  tests  of  general  intelligence  is  found 
in  the  widespread  use  to  which  they  have  been  put  in 
education,  industry,  the  armed  services,  and  research  in 
recent  years.  Tests  of  general  mental  ability  are  cur¬ 
rently  used  for  purposes  of  selection,  prediction,  group¬ 
ing,  guidance  and  research  on  the  assumption  that  they 
are  identifying  or  estimating  what  are  commonly  conceived 
of  as  underlying  potential  capacities. 

Refinement  of  intellectual  measures,  however,  has 
usually  been  in  the  direction  of  improving  concurrent  or 
predictive  validity  in  relation  to  academic  criteria, 
while  the  theoretical  foundations  on  which  the  tests  were 
originally  constructed  have  frequently  been  ignored  in  an 
attempt  to  meet  popular  demands  for  better  prediction. 

The  outcome  of  this  trend  has  been  the  construction  of 
more  instruments,  which,  for  the  average  testee,  will 
estimate  his  chances  of  future  success  or  failure  with 
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a  high  degree  of  accuracy,  assuming: 

a.  that  the  subjects  tested  fall  within  a 
specific  cultural  or  educational  range; 

b.  that  no  systematic  change  in  treatment  occurs 
after  the  occasion  of  testing. 

The  limitations  of  these  assumptions  can  be 
clearly  seen  in  the  case  of  examinees  who  are  handicapped 
by  virtue  of  unfamiliarity  with  the  language  or  materials 
used  to  obtain  samples  of  intelligent  behaviour.  Obvi¬ 
ously,  a  European  immigrant,  recently  arrived  in  Canada, 
could  scarcely  be  expected  to  think  as  rapidly  in  a 
problem  of  dollars  and  cents  as  a  Canadian  citizen  of 
similar  intellectual  ability.  Similarly,  a  child  from 
city  slums  or  country  backwoods  could  scarcely  give  his 
best  in  tests  which  reflect  an  urban  middle-class  cul¬ 
ture.  It  can  readily  be  recognized  that  items  about  orch¬ 
estras,  railways,  refrigerators,  restaurants,  street 
curbs,  distances  between  cities,  or  comprehension  problems 
based  on  middle- class  values  favour  some  sub-groups  of 
the  population  more  than  others.  Yet,  such  items  are 
frequently  incorporated  in  current  intelligence”  tests. 

If  a  change  of  treatment  between  test  and  criterion 
increases  the  knowledge  and  vocabulary  of  a  handicapped 
testee,  presumably  his  measured  intelligence  would  change 
accordingly.  This  raises  the  question  as  to  whether  it 
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may  not  be  possible  to  identify  a  measure  of  more  funda¬ 
mental  ability,  which  is  less  susceptible  to  the  vagaries 
of  vocabulary  improvement  and  familiarity  with  urban 
middle-class  life.  The  investigation  of  this  problem 
reduces  the  emphasis  on  psychometric  prediction  alone 
and  requires  a  more  thorough  investigation  of  the  theo¬ 
retical  problem  of  what  the  instruments  used  are  measuring. 
It  is  here  that  psychometrics  is  linked  most  closely  with 
psychology,  sociology,  and  neurology.  It  seems  almost 
axiomatic  to  state  that  only  through  the  mutual  interaction 
of  these  fields  of  knowledge  can  we  hope  to  improve  our 
understanding  and  manipulation  of  the  factors  involved  in 
mental  test  performance. 

Thus,  our  present  conventional  mental  tests  enjoy 
widespread  popularity  and  usage,  but  the  widening  breach 
between  psychometrics  and  psychological  theory  has 
threatened  to  leave  the  concept  of  t,intelligenceTt  in  a 
precarious  position.  For,  while  tests  which  profess  to 
estimate  intelligent  behaviour  move  further  and  further  over 
into  the  realms  of  achievement  and  past  experience,  the 
problem  of  what  they  measure  becomes  increasingly  more 
complicated  by  their  added  emphasis  on  acquired  knowledge. 

To  identify  an  instrument  which  reduces  the  impor¬ 
tance  of  these  experiential  factors  while  estimating  an 
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individual’s  intellectual  level  requires  a  detailed 
consideration  of  the  concept  of  "intelligence"  and  a 
knowledge  of  the  relationships  between  environmental 
factors  and  test  performance. 

Intelligence  and  Cultural  Differences.  Of  the 
more  important  categories  of  environmental  variables 
which  appear  to  influence  test  performance  on  conven¬ 
tional  intelligence  tests,  those  associated  with  socio¬ 
economic  status  must  receive  early  priority.  Numerous 
research  studies  have  shown  a  marked  positive  relation¬ 
ship  between  socio-economic  status  and  intelligence 
test  score.  Most  investigators  agree  that  such  a  rela¬ 
tionship  exists,  but  the  reasons  for  its  existence  have 
provoked  considerable  argument  amongst  psychologists. 

There  are  those  who  contend  that  hereditary 
differences  contribute  the  major  portion  of  the  variance 
to  intelligence  quotients  since  the  more  gifted  adults 
gradually  rise  to  the  top  of  the  social  and  economic 
ladders,  while  passing  on  to  their  children  their  sup¬ 
erior  intellectual  endowment. 

On  the  other  hand,  many  psychologists  who  empha¬ 
size  the  variation  in  environmental  stimulation  for 
children  of  the  various  socio-economic  levels  maintain 
that  differences  in  test  scores  are  due  primarily  to 
differences  in  opportunity  to  develop  intelligence. 
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More  recently,  a  third  possible  influence  which 
largely  sidesteps  the  nature-nurture  controversy,  has 
received  considerable  attention.  This  is  the  contention 
that  differences  in  intelligence  test  score  are  fre¬ 
quently  due  to  cultural  bias  which  is  built  into  the 
test  items.  Advocates  of  this  position  have  demon¬ 
strated  that  certain  intelligence  tests  show  the  rela¬ 
tionship  with  social  status  to  a  greater  degree  than 
others,  while  single  items  differentiate  in  such  an 
irregular  manner  between  upper  and  lower  social  classes 
that  an  interpretation  in  terms  of  cultural  advantage 
appears  plausible. 

Other  investigators  have  added  the  variables  of 
test-motivation  and  work  habits  to  the  contributing 
causes  of  social  class  differences.  If  these,  too,  are 
considered  important,  the  difficulty  of  constructing  a 
test  which  would  provide  a  fair  assessment  of  the  intel¬ 
lectual  potential  of  individuals  in  all  groups  of  society 
immediately  becomes  apparent. 

Whatever  future  research  reveals  about  the  relative 
importance  of  heredity  and  environment  in  intelligence 
test  results,  it  is  surely  possible  to  improve  our  present 
instruments  by  concentrating  our  efforts  on  the  third 
contributing  factor,  cultural  advantage.  If  an  intelli¬ 
gence  test  can  be  devised  which  taps  a  basic  aspect  of 
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intellectual  growth,  while  reducing  the  cultural  bias 
involved,  it  should  make  possible  the  improved  identi¬ 
fication  of  gifted  individuals  from  handicapped  or 
understiinulated  environments. 

In  recent  years,  the  urgency  of  this  problem  of 
talent  identification  has  been  strongly  emphasized  by 
the  publications  of  such  authorities  as  the  American 
Association  for  the  Advancement  of  Science,  (195$),  the 
American  Social  Science  Research  Council  (McClelland 
et  al. ,  195$),  the  Rockefeller  Report  on  problems  of 
American  education  (195$),  and  by  newspapers  and  maga¬ 
zines  across  the  continent. 

As  the  Report  of  the  American  Association  for 
the  Advancement  of  Science  points  out,  a  minimum  school 
testing  program  should  include  at  Grades  VII  and  IX  a 
scholastic  aptitude  test 

.  .  .  as  little  dependent  on  academic  skills  as  possible. 
This  should  should  be  a  reasoning  test,  based  on  material 
not  directly  taught  in  the  school  ....  It  should  be 
particularly  useful  in  drawing  attention  to  able  pupils 
who  have  not  made  normal  progress  in  verbal  school  work 
and  who,  therefore,  need  remedial  teaching.  (195$) 

A  measuring  instrument  shown  to  be  useful  for  these 

purposes  is  unfortunately  lacking. 

The  potential  usefulness  of  an  intelligence  test 
which  calls  on  a  minimum  of  acquired  skills  can  be  seen 
also  in  the  improved  classification  of  school  pupils 
from  city  slums,  rural  backwoods,  or  foreign  cultures, 
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and  in  the  facilitation  of  the  mental  assessment  of 
sensory-handicapped,  maladjusted,  brain-damaged,  and 
suspected  backward  children.  With  a  culture-reduced 
test,  research  workers  who  attempt  to  hold  intelli¬ 
gence  constant  may  do  so  with  greater  justification, 
especially  if  an  experiment  puts  little  premium  on 
verbal  skills.  In  the  absence  of  valid  aptitude  tests, 
vocational  counsellors  and  employers  in  many  occupa¬ 
tions  without  verbal  emphasis  may  find  culture-reduced 
measures  more  valid  than  verbal  ones.  Neurologists 
would  benefit,  too,  if  a  test  could  be  devised  ’which 
is  more  sensitive  to  changing  mental  processes  after 
brain  operations. 

Since  the  conventional  verbal  intelligence 
tests  appeared  to  be  of  limited  usefulness  for  such 
functions  as  these,  it  was  the  general  purpose  of  the 
present  investigation  to  identify  tests  or  sub-tests 
which  appeared  to  be  promising  instruments  for  estim¬ 
ating  intellectual  ability  with  a  minimum  of  cultural 
bias . 

More  specifically,  the  purpose  of  the  investi¬ 
gator  was  to  identify  tests  or  sub-tests  which,  for  a 
representative  sample  of  271  Grade  VII  Edmonton  chil¬ 
dren  were  valid  measures  of  general  intellectual 
ability,  and  showed  a  minimum  of  socio-economic  bias. 
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The  approach  taken  was  to  evaluate  nine  promising  tests 
and  sub-tests  by  comparison  with  conventional  verbal 
tests  of  intelligence  and  achievement  in  relation  to 
socio-economic  status.  Factor  analytic  techniques  were 
used  to  identify  the  abilities  sampled  by  each  kind  of 
test,  correlations  with  socio-economic  status  at  differ¬ 
ent  ages  were  compared,  item  discrimination  patterns 
were  studied,  and  relationships  with  school  achievement 
were  analyzed  in  order  to  identify  the  kinds  of  items 
and  sub-tests  which  could  be  regarded  as  most  approp¬ 
riate  for  the  general  purpose  of  the  study. 

Thus,  culture-reduction  was  examined  in  relation 
to  socio-economic  variables  only,  on  the  assumption 
that  these  reflect  differences  in  environmental  stimu¬ 
lation  and  acquired  knowledge  sensitively  enough  to 
expose  tests  which  make  it  difficult  for  underprivi¬ 
leged  children  to  reveal  their  potential  ability. 

Whether  the  most  promising  tests  of  this  study  are 
equally  suitable  for  children  from  non-American  or  non- 
urban  cultures  is  a  subject  for  future  research* 
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CHAPTER  II 


REVIEW  OF  RELATED  RESEARCH 


The  Binet  Scale .  Many  studies  from  Binet Ts 
generation  to  the  present  day  have  revealed  a  marked 
positive  relationship  between  the  measured  intelligence 
of  school  children  and  their  socio-economic  background. 
Writing  in  1908,  Binet  recognized  the  fact  that  results 
for  his  measuring  scale  depended: 

First  on  the  intelligence  pure  and  simple;  second,  on 
extra-scholastic  acquisitions  capable  of  being  gained 
precociously;  third,  on  scholastic  acquisitions  made 
at  a  fixed  date;  fourth,  on  acquisitions  relative  to 
language  and  vocabulary,  which  are  at  once  scholastic 
and  extra-scholastic,  depending  partly  upon  the  family 
circumstances.  (Binet,  1916,  p.  259) 

Three  years  later,  he  re-emphasized  these  envir¬ 
onmental  influences  in  his  explanation  of  the  discrep¬ 
ancies  between  his  own  test  results  and  those  of  the 
two  Belgian  educators,  M,  Decroly  and  Idle.  Degand. 
Binet  attributed  the  eighteen-month  difference  in  mean 
mental  age  between  his  own  sample  and  the  Belgian  group 
to  a  difference  of  "social  condition."  The  Parisian 
standardization  sample  was  recruited  almost  exclusively 
from  "working  class"  children,  while  the  Belgian  pupils 
had  parents  who  belonged  to  a  social  class  "in  easy 
circumstances,"  and  who  understood  the  value  of  educa¬ 
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It  is  plain  that  Binet  was  aware  of  social  class 
differences  in  both  physique  and  educational  aptitude. 

It  is  plain,  too,  that  he  would  not  account  for  lower 
class  "inferiorities"  in  terms  of  heredity  alone.  As 
he  pointed  out,  "some  probably  are  acquired  and  result 
from  unavoidable  and  accessory  circumstances,  others 
are  probably  congenital."  (p.  31$)  Nevertheless,  Binet 
continued  to  regard  his  scale  as  a  fair  measure  of 
"intellectual  capacity"  in  spite  of  the  item  analysis 
he  conducted  on  the  1911  scale.  Of  the  thirteen  tests 
which  showed  most  discrimination  between  the  upper  and 
working  class  samples,  six  were  language  items,  while 
four  more  were  factual  items  depending  upon  home  train¬ 
ing.  Significantly,  these  were  predominantly  what  Binet 
classified  as  "acquisitions,"  in  contrast  to  "intelli¬ 
gence  pure  and  simple." 

Binet's  interpretation  of  social  class  differences 
in  terms  of  superior  language  environment  appears  sound 
and  justifiable.  However,  it  appears  that  he  failed  to 
realize  the  importance  of  these  findings  in  relation  to 
the  validity  of  his  intelligence  tests  for  all  segments 
of  the  population.  It  is  unfortunate  that  subsequent 
test  constructors  and  users  have  frequently  shown  a 
similar  lack  of  perception. 

The  Binet  Scale  of  1911  set  the  pattern  for  later 
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measures  of  intellectual  capacity.  Yet  the  Binet  Scale 
was,  and  has  become  even  more  loaded  with  verbal  items  of 
the  kind  which  differentiated  markedly  between  social 
classes.  If  these  tasks  which  discriminate  clearly  between 
social  classes  predominate  over  the  remainder,  and  consti¬ 
tute  a  major  criterion  for  subsequent  test  validation  and 
item  inclusion,  it  is  obvious  that  little  advance  will  be 
made  towards  the  assessment  of  Binet’ s  concept  of  ”intel- 
ligence  pure  and  simple.” 

Later  Studies  on  Socio-Bconomic  Differences .  Subse¬ 
quent  research  studies  with  adaptations  of  the  first  intel¬ 
ligence  scale  in  other  countries  showed  results  similar  to 
Binet’ s.  William  Stern  (1912)  reported  a  twelve  month 
difference  in  mental  age  between  samples  of  German  upper 
and  lower  class  children.  Cyril  Burt  (1922)  found  similar 
results  in  England,  and  high-lighted  the  nature  of  the 
social  class  differences  with  an  item  analysis.  Again  lin¬ 
guistic,  literary  and  ’’higher  culture”  items  were  favoured 
by  upper  class  children,  while  tests  of  practicality,  shrewd¬ 
ness,  and  perception  showed  the  reverse  discrimination. 

Early  American  studies  were  largely  carried  out 
with  the  Stanford  revisions  of  the  Binet  Scale.  For 
these,  Terman,  like  Binet,  used  school  progress  and  age 
increments  as  his  criteria  for  item  inclusion,  and  was 
able  to  show  large  and  persisting  I.Q.  differences  be¬ 
tween  American  children  of  seven  different  occupational 
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classes  (Tennan  and  Merrill,  1937). 

Such  differences  have  commonly  been  reported  in 
research  studies  in  many  parts  of  the  globe.  The  vol¬ 
uminous  reviews  of  these  findings  by  Neff  (193$), 
Loevinger  (1940),  Herrick  (1951),  and  Anastasi  (195$), 
indicate  that  the  I.Q.  difference  between  highest  and 
lowest  occupational  groups  is  usually  about  15  to  25 
points  on  conventional  verbal  tests.  Where  correlation 
techniques  are  used,  the  coefficients  vary  from  .25  to 
.50  depending  on  the  age  and  homogeneity  of  the  popula¬ 
tion  sampled,  on  the  correlation  coefficient  computed, 
and  on  the  kinds  of  functions  measured  under  the  name 
of  intelligence  and  socio-economic  status. 

Causes  of  the  Differences .  Correlations  reveal 
little  about  causes.  Although  psychologists  agree  that 
tested  intelligence  is  related  to  social  status,  there 
is  much  divergence  of  opinion  surrounding  the  reasons 
for  the  relationship.  Of  the  more  prominent  interpre¬ 
tations  in  the  literature,  those  which  place  varying 
emphasis  on  heredity,  environment,  and  test  bias  may 
enlighten  the  issues  involved. 

(a)  Heredity.  Terman  (1940),  for  instance,  has  argued 
for  the  major  importance  of  heredity,  with  strong  evidence 
from  his  Stanford-Binet  standardization  group  (1937)o 

The  mean  I.Q.’s  for  different  occupational  groups 
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remain  relatively  constant  for  all  ages,  whereas  the 
environmentalist  hypothesis  would  presumably  require 
them  to  change,  with  increasing  divergence  at  the 
extremes  of  the  socio-economic  scale. 

Another  psychologist,  Burt,  (1959)  concludes 
after  statistical  analyses  of  twin  studies,  foster  chil¬ 
dren  and  social  class  differences  that  "differences  in 
average  intelligence  exhibited  by  the  different  socio¬ 
economic  classes  in  this  country  at  the  present  time  are 
mainly  (seventy-five  per  cent),  though  not  perhaps  en¬ 
tirely  the  outcome  of  genetic  difference"  (1959  b,  p.  2$). 
He  subsequently  raised  the  proportion  to  eighty-eight  per 
cent  still  in  favour  of  heredity. 

As  a  caution  to  Terman’s  argument,  Anastasi 
(1953j  P.  522)  pointed  out  that  environmental  differ¬ 
ences  may  have  less  opportunity  to  operate  after  chil¬ 
dren  begin  school,  due  to  the  equalizing  influences  of 
compulsory  education.  Moreover,  the  phenomenon  of 
regression  towards  the  mean  is  bound  to  work  against 
the  environmentalist  case  and  thus  make  divergence  of 
extremes  more  difficult  to  obtain.  It  should  be  noted, 
too,  that  different  functions  are  measured  at  different 
ages  by  the  Stanford-Binet  test,  making  age  comparisons 
dubious  at  least.  Even  if  the  psychologist  could  demon¬ 
strate  the  infallibility  of  his  data,  his  opponent  might 


: 

■ 

„  ■ 

. 

c  •  '•••  ■■  ••  •'  ' 

.  i  '■  1  J)'~  ••• ..  vij  .  v 

■ 

...  "  ;  '  '  '■  ~  ■'  .  ■ 

;  .  ;  : 

.  .  ■  .  .  ■  •  ;  :  .  / ..  . 

* 

. 

c  .  : 

■ 

c  ,  ,  !-:•  "  ol;  j.i 

•  ■  -  ■  c 


14 


argue  that  differential  stimulation  in  the  first  two 
years  of  life  could  account  for  the  later  differences 
in  measured  intelligence,  a  fact  strongly  suggested,  by 
HebbTs  work  in  animal  psychology  (1949). 

Burt’s  approach  to  the  statistical  analysis  of 
the  relative  contributions  of  heredity  and  environment 
to  intelligence  quotient  variance  has  been  rejected  by 
Anastasi  as  ’’futile”  and  by  Hebb  as  ’’nonsense”  owing  to 
its  dubious  assumptions  of  the  additive  nature  of  the 
heredity-environment  relationship.  Anastasi  pointed  out 
that  the  contribution  of  any  hereditary  factor  depends 
entirely  upon  the  environmental  factors  within  which  it 
is  allowed  to  operate.  Similarly,  the  effect  of  any 
environmental  influence  will  vary  according  to  the  in¬ 
herited  factors  it  affects.  Furthermore,  Burt’s  depen¬ 
dence  upon  studies  of  identical  twins  reared  apart  is 
open  to  question  on  the  basis  of  small  N’s,  the  possi¬ 
bility  of  sample  bias ,  the  observed  correlation  between 
intelligence  quotient  difference  and  discrepancy  in 
judged  environmental  stimulation,  and  the  fact  that 
identical  twins  reared  together  tend  to  have  more  similar 
experiences  than  fraternal  twins,  who  are  often  of  dif¬ 
ferent  sexes.  This  crucial  line  of  research  necessarily 


awaits  further  evidence 
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(b)  Environment.  The  environmentalist  case  is  well 
represented  by  the  Iowa  school  on  this  continent,  while 
Floud  and  Halsey  have  recently  been  its  most  outspoken 
proponents  in  Great  Britain. 

The  first  group  point  to  studies  of  Skeels  and 
Fillmore  (1937)  who  found  a  progressive  deterioration 
of  I.Q.  with  increasing  age  for  children  in  the  lowest 
occupational  levels.  Liechenstein  and  Brown  (193$) 
produced  similar  results  for  children  from  delinquency 
areas.  Corroborating  these  results  are  such  findings 
as  those  of  Skeels  on  the  effect  of  nursery  school 
attendance  on  the  I.Q.’s  of  orphanage  children,  and 
Skodak  on  the  mental  development  of  children  from  in¬ 
ferior  homes  who  were  placed  in  superior  adoptive  homes. 
As  summarized  by  Skeels  (1940),  these  studies  all  indi¬ 
cate  the  extreme  sensitivity  of  the  I.Q.  to  change  with 
environmental  stimulation. 

Across  the  Atlantic,  Floud  and  Halsey  (195$), 
after  making  an  analysis  of  the  social  class  of  English 
grammar  school  entrants,  endorsed  the  hypothesis  pre¬ 
viously  proposed  by  Neff  (193$)  that  the  distribution 
of  innate  intelligence  among  different  social  classes 
is  approximately  random.  They  did  not  deny  the  exis¬ 
tence  of  innate  differences  in  intelligence,  but 
attributed  the  differences  between  social  classes  to 
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variation  in  environmental  stimulation.  Writing  on  the 
assumption  that  innate  intelligence  is  determined  by  a 
multiplicity  of  genes,  Halsey  (1958)  drew  up  a  genetic 
model,  using  the  available  evidence  concerning  social 
mobility  to  argue  that  variation  in  social  structure 
is  insufficient  to  account  for  the  belief  that  the  re¬ 
lationship  between  intelligence  and  social  status  is 
attributable  to  hereditary  causes.  From  this  consider¬ 
ation  and  from  a  study  of  the  small  reduction  in  the 
variance  of  measured  intelligence  which  results  from 
classifying  children  according  to  social  class  (for 
example,  100  per  cent  to  79  per  cent),  Halsey  was  led 
to  believe 

.  .  .  that  observed  differences  between  social  classes 
in  measured  intelligence  are  more  likely  to  be  explained 
by  environmental  rather  than  genetic  factors.  ...  It 
is  possible  that  class  differences  in  measured  intelli¬ 
gence  may  be  entirely  environmental.  (1958,  p.  27) 

This  belief  has  since  been  re-asserted  by  Halsey  as  a 

challenging  hypothesis  for  hereditarians  to  disprove 

(1959). 

The  Iowa  studies  have  been  roundly  criticized  by 
McNemar  (1940)  and  Goodenough  (1940)  for  the  uncontrolled 
variables,  for  the  use  of  inflated  NTs,  for  correlating 
percentiles  and  for  ignoring  the  effects  of  regression, 
amongst  other  psychometric  sins.  Moreover,  few  subse¬ 
quent  studies  have  been  able  to  substantiate  the  extreme 
Iowa  claims  for  the  modifiability  of  the  intelligence 
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quotient. 

Halsey Ts  hypothesis  has  been  decried  by  such 
critics  as  Conway  (195$)  and  Burt  (1959  b).  The  former 
referred  to  international  findings  on  identical  twins 
reared  apart,  to  the  evidence  of  non-verbal  intelli¬ 
gence  test  class  differences,  and  to  the  accumulation 
of  studies  showing  the  limited  extent  of  intelligence 
quotient  change  with  environmental  stimulation.  Burt 
questioned  Halsey1 s  figures  on  social  mobility  and 
maintained  that  the  tendency  for  -  individuals  of  high 
intelligence  to  rise  to  a  higher  class  has  been  oper¬ 
ative  and  increasing  for  "thirty  generations  or  more.” 
Hence,  he  concluded  that  there  could  well  be  suffic¬ 
ient  differences  in  the  genetic  composition  of  the 
social  classes  to  explain  the  test  results  in  heredi¬ 
tary  terms. 

In  the  light  of  the  foregoing  studies  and 
criticisms  it  is  evident  that  agreement  on  nature- 
nurture  balance  sheets  is  as  far  distant  as  ever. 

A  more  sophisticated  approach  to  the  problem  is  to 
regard  both  heredity  and  environment  as  essential 
influences  on  intelligence,  their  importance  vary¬ 
ing  according  to  the  extent  to  which  one  is  held 
constant  while  the  other  is  varied.  As  Hebb  points 
out,  "to  ask  how  much  heredity  contributes  to  intel- 
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ligence  is  like  asking  how  much  the  width  of  a  field 
contributes  to  its  area,”  (195$,  p.  120).  As  long  as 
investigators  produce  evidence  from  studies  in  which 
the  homogeneity  of  one  or  both  of  the  two  influences 
is  inexadtly  determined,  so  long  will  the  issue  on  the 
relative  importance  of  heredity  or  environment  continue 
to  be  debated. 

(c)  Test  Bias .  A  third  interpretation  of  the  rela¬ 
tionship  between  intelligence  and  social  status, 
proposed  by  a  Chicago  group  of  psychologists,  is  in 
terms  of  cultural  bias  in  the  tests  used.  Havighurst, 
for  instance,  after  reviewing  the  research  of  Warner, 
Davis,  and  others  on  class  differences  in  child  up¬ 
bringing,  vocabulary,  family  life,  and  attitude  to 
education,  concluded  .  .  . 

that  middle-class  and  lower-class  children  bring  to  the 
intelligence  test  situation  widely  disparate  cultural 
experiences,  by  virtue  of  their  social-class  experience, 
and  that  middle-class  children  get  more  out  of  them¬ 
selves  in  the  ordinary  school  test  situation  than  do 
lower-class  children.  (Havighurst,  1951,  p.  21) 

Consequently,  the  conventional  intelligence  test  is 

likely  to  pose  problems  whose  language  and  content  are 

more  familiar  to  some  segments  of  society  than  others. 

Eells  (1951)  has  produced  strong  evidence  to 
support  the  contention  that  the  bias  in  test  items 
exists,  and  that  it  tends  to  favour  upper  middle-class 
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children.  Eight  widely  used  group  intelligence  tests 
were  administered  to  the  nine,  ten,  thirteen,  and 
fourteen  year  old  children  in  a  mid-western  city  and 
the  results  analyzed  in  relation  to  social  class. 

From  this  study  Eells  concluded: 

1.  That  group  verbal  intelligence  tests  show 
a  greater  correlation  with  social  status  than  non¬ 
verbal  tests. 

2.  That  items  which  depend  upon  the  interpre¬ 
tation  and  manipulations  of  verbal  symbols  show  greater 
discrimination  between  social  classes  than  items  using 
picture,  geometric  design,  stylized  drawing  items. 

3.  That  certain  verbal  items  differentiate 
between  social  classes  considerably  more  than  others. 

In  generalizing  about  the  items  which  appeared  to 
favour  upper  and  middle-class  children,  Eells  con¬ 
cluded  that  many  involved  an  "academic  or  bookish 
vocabulary,"  or  "words,  objects,  or  concepts  with 
which  high  status  pupils  probably  have  more  opportu¬ 
nity  for  familiarity  through  home  and  other  non-school 
experiences."  (1951,  p.  54) 

Amongst  his  examples,  Eells  showed  that  an  item 
which  involved  familiarity  with  the  word  "sonata"  was 
answered  correctly  by  seventy-four  per  cent  of  the 
high  status  group,  as  compared  with  twenty-nine  per 
cent  of  the  lower  status  children,  (1951,  p.  306) 


. 

. 

. 

j  '  :  -.L  ■  .  .  * 

.  .  .■  :■ .  /'J  :■  .  ■  I  ■  .  ■  -  ;o  ; 

"  '  I  ■ 

. 

. 

d  . ...  i  ■->'/ -  ul 

<  •  .  J  .  a'-  . .  ■  ■  ■  ■  v  'j. 

;  ^  ■; ■,  :  ■  / o 

<  ■ •  •. 


. 

■  •  -  ,  •  Id';  - 


<  ..  .  V-  .  . 


. 


20 


By  contrast,  an  item  requiring  knowledge  of  cutting 
tools  evoked  the  right  answer  from  seventy-one  per 
cent  of  high  status  and  seventy-four  per  cent  of  low 
status  children.  (1951,  p.  320) 

Further  examples  of  such  discrepancies  in  item 
discrimination  between  social  classes  are  so  common 
in  Eells’  study  that  it  is  difficult  to  ignore  the 
case  which  the  investigation  was  designed  to  demon¬ 
strate. 

Similar  studies  relating  abilities  to  social 
class  have  been  performed  by  Havighurst  and  Janke 
(1944)  in  a  small  city  which  they  called  Midwest.  All 
ten  and  sixteen  year  old  pupils  in  the  community  were 
given  tests  of  verbal  and  non-verbal  intelligence, 
reading,  space,  mazes,  and  mechanical  assembly.  When 
the  results  were  analyzed  in  relation  to  five  social 
classes,  significant  differences  between  classes  ap¬ 
peared  in  the  majority  of  tests,  especially  between 
extreme  classes,  the  notable  exceptions  being  the  mech¬ 
anical  and  spatial  tests  which  showed  little  change 
between  groups. 

A  later  study  on  the  ten  year  olds,  carried  out 
three  years  later  by  Havighurst  and  Breese  (1947) 
investigated  the  relationship  between  Thurstone’s 
Primary  Mental  Abilities  and  social  class.  Once  again 
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the  middle  classes  performed  significantly  better 
than  the  lower,  but  the  size  of  the  relationship 
between  test  results  and  social  status  varied  accord¬ 
ing  to  the  ability  measured.  Thus,  V  and  N  correlated 
.42  and  .32  respectively  with  the  Index  of  Status 
Characteristics,  while  the  corresponding  coefficients 
for  Space,  Reasoning,  and  ^mory  were  .25  .23  and  .21. 
Although  these  coefficients  are  not  significantly  dif¬ 
ferent,  the  authors  attributed  this  to  small  NTs  and 
went  on  to  conclude  that  the  correlations  with  social 
status  are  higher  for  tests  of  those  functions  that 
appear  to  be  favoured  by  a  superior  social  environment. 
The  findings  are  in  line  with  expectation,  but  the 
small  numbers  on  which  this  unique  set  of  studies  is 
based  make  for  caution  in  generalization.  The  average 
size  of  each  social  class  was  only  31.5  pupils,  while 
two  of  the  studies  reported  showed  no  pupils  in  the 
top  two  classes. 

Eells  has  been  taken  to  task  by  Tyler  (1953) 
for  omitting  to  correct  his  correlation  coefficients 
for  attenuation.  When  Tyler  did  this,  the  differences 
in  correlations  with  social  class  between  verbal  and 
non-verbal  tests  were  reduced  to  .03,  which  is  not  sig¬ 
nificant.  In  fairness  to  Eells,  it  should  be  pointed 
out  that  the  reliability  coefficients  Tyler  used  for 
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correction  were  calculated  by  different  methods  and 
therefore  incomparable. 

Another  critic,  Lorge  (1952)  has  pointed  out 
that  item  analyses  in  terms  of  social  levels  reveal 
nothing  about  the  ability  of  each  item  to  differen¬ 
tiate  between  intellectual  levels.  Irregular  patterns 
of  item  discrimination  may,  then,  be  due  to  the  fact 
that  some  items  do  not  tap  intellectual  ability  as  well 
as  others. 

Lorge  suggests  that  more  logical  ways  of  evalua¬ 
ting  items  from  Eells’  position  would  be  to  adopt  one  of 
the  following  approaches. 

1.  He  might  perform  a  factor  analysis  on  the 
test  results,  calculate  TTgn  and  ”wu  (social  status) 
scores  for  each  individual  and  then  estimate  the  corre¬ 
lation  between  success  on  each  item  and  "g”  score,  inde¬ 
pendent  of  "w". 

20  He  might  partial  out  social  status  in  an 
attempt  to  obtain  for  each  individual  on  each  test  a 
score  which  is  independent  of  social  status. 

Whatever  the  shortcomings  of  Eells'  study,  it  is 
apparent  that  environmental  factors  are  more  important 
in  some  intelligence  tests  than  others.  This  general¬ 
ization  implies : 

1.  that  test  users  should  be  more  cautious  in 
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their  interpretation  of  results  on  children  from  low 
socio-economic  levels  and 

2.  that  attention  should  be  given  to  the  con¬ 
struction  of  intelligence  tests  as  free  as  possible 
from  cultural  influences. 

Culture-Free  and  Culture-Fair  Tests.  The  earlier 
summaries  on  the  effects  of  culture  on  intelligence 
quotient  scores  led  to  various  attempts  to  produce 
what  cane  to  be  known  as  "Culture-Free ff  tests.  Cattell 
(1940),  for  instance,  argued  that  conventional  verbal 
intelligence  tests  can  shed  very  little  light  on  such 
controversial  issues  as  the  comparative  mental  ability 
of  individuals  from  different  races,  classes,  or  gener¬ 
ations  . 

It  is  logically  possible  for  Neff  to  argue  that 
there  are  no  social  status  differences  in  innate  intel¬ 
ligence,  or  for  Klineberg  to  argue  that  even  the  most 
biologically  distant  racial  groups  do  not  differ  in 
average  native  ability,  in  the  face  of  the  general 
sense  of  all  the  direct  and  indirect  evidence  to  the 
contrary.  (1940,  p.  162) 

In  his  attempts  to  remedy  this  situation,  Cattell 
devised  a  "Culture-Free"  test  of  "g"  (1944),  in  three 
scales  to  cover  all  levels  from  four  years  of  age  to 
superior  adult.  The  original  test  was  non-verbal  and 
composed  entirely  of  perceptual  items  which  previous 
investigators  had  shown  to  be  promising  for  the  purpose. 
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In  the  revised  form,  there  are  four  sub-tests  named 
Series,  Classification,  Matrices,  and  Conditions  and 
the  reasoning  required  is  all  performed  through  a  med¬ 
ium  of  abstract  patterns,  which  Cattell  assumed  would 
be  as  unfamiliar  for  one  cultural  group  as  for  another. 

Cattell  did  not  overlook  the  possibility  of 
using  equally  familiar  test  materials  in  his  culture- 
free  test.  Indeed,  he  listed  a  large  number  of  objects 
and  processes  which  are  sufficiently  common  to  all 
cultures  to  form  a  nucleus  for  such  a  test  (1940). 
However,  he  admitted  the  force  of  the  objection  raised 
by  cultural  anthropologists  that  the  same  objects  are 
invested  with  different  meanings  by  different  cultural 
groups.  Consequently,  his  own  approach  lay  through  the 
perceptual  item  which  had  been  shown  to  have  high  load¬ 
ings  on  Spearman’s  ”g”  factor  (Cattell,  1940)  and  no 
other,  while  ’’apperceptual  associations”  were  reduced 
to  a  minimum.  To  answer  the  criticism  that  his  test 
modality  was  too  narrow  to  sample  a  complex  mental 
function  adequately,  Cattell  called  on  Spearman’s  ”Theo 
rem  of  Indifference  of  Indicator,”  which  arose  from  the 
demonstrated  fact  that  the  ”g”  factor  can  be  soundly 
measured  by  tests  from  any  field,  no  matter  how  speci¬ 
fic  (Spearman,  1932,  p„  197). 

It  was  left  to  Davis  and  Eells  to  take  the 
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"common  knowledge"  approach  to  the  problem  of  culture- 
fair  tests.  As  a  follow-up  project  to  their  study  on 
cultural  differences  in  intelligence,  they  constructed 
in  1953  a  test  known  as  the  Davis-Eells  Games.  This 
test  requires  no  reading,  all  instructions  being  given 
orally.  Like  the  Cattell  test,  it  is  non-verbal,  but 
it  depends  upon  pictorial  rather  than  abstract  items. 
The  essential  aim  of  the  test  designers  was  to  find 
problems  chosen  from  every-day  life  experiences  of 
children  from  all  social  classes  within  an  urban  Amer¬ 
ican  culture  context.  The  test  items  selected  for 
initial  experimentation  were  based  on 

.  .  .  the  insights  of  a  number  of  educators  and  socio¬ 
logists  familiar  with  the  characteristic  modes  of 
living  and  child  upbringing  at  different  socio-economic 
levels,  and  in  part  upon  systematic  observation  of 
children  in  free-time  activities  (on  playgrounds,  in 
neighbourhood  groups,  in  schools  .  .  .)  (Davis  and 
Sells,  1953,  p.  IS). 

Evaluation o  Many  investigators  have  attempted 
the  difficult  task  of  evaluating  these  culture-fair 
tests.  The  usual  procedures  of  correlation  with  exist¬ 
ing  instruments  or  educational  criteria  are  of  little 
help,  since  they  are  culturally  weighted  to  begin  with. 
A  low  correlation,  however,  would  be  cause  for  concern 
when  a  homogeneous  social  class  group  is  used.  The 
usual  approach  has  been  in  terms  of  comparisons  with 
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other  tests  in  relation  to  social  class  or  foreign 
background.  Other  possible  criteria  are  long-term 
prediction  value  and  susceptibility  to  practice  effects. 

The  few  studies  on  Cattell’s  test  of  "g"  have 
usually  shown  good  reliability  (for  example,  Anastasi 
and  Cordova,  1953)  and  g  loadings  in  batteries  of  other 
ability  and  achievement  tests.  Cattell  (195$)  quoted 
several  g  loadings  around  .  $  in  French  and  American 
immigrant  samples,  while  correlations  with  other  intel¬ 
ligence  tests  such  as  the  Stanford-Binet  and  Otis  were 
high  enough  (.6  to  .7)  to  assume  that  an  important 
aspect  of  intelligence  was  being  sampled.  Drake  (1955) 
indicated  that  its  correlation  with  school  achievement 
is  as  high  as  that  of  the  Otis,  and  suggested  that  the 
test  has  considerable  potential  for  "equating  all  sub¬ 
jects  for  past  subject-matter-training  effects." 

Marquart  and  Bailey  (1955)  showed  that  the 
Cattell  Test  differentiated  significantly  less  than  the 
Stanford-Binet  between  social  classes  at  the  Scale  II 
level  (ages  eight  to  fifteen),  but  Fowler  (1955)  was 
unable  to  find  such  a  difference  when  he  compared  the 
Cattell  Test  with  the  Henmon-Nelson  and  Detroit  Alpha 
Tests.  Cattell,  Feingold,  and  Sarason  (1941)  demon¬ 
strated  that  of  four  tests  administered  to  newly  arrived 
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American  immigrants  ,  the  Cattell  Test  was  less  sensi¬ 
tive  than  verbal  measures  to  the  cultural  influences 
to  which  the  immigrants  were  exposed  in  their  first 
three  months  in  America.  Cattell  (1958)  quoted  another 
study  of  a  complete  ten-year-old  cross  section  of  a 
city  of  300,000  with  the  Culture-Free  Test,  which 
showed  that  neither  mean  nor  standard  deviation  had 
changed  over  a  thirteen-year  period.  This  finding 
puts  the  test  in  a  favourable  light  in  comparison 
with  the  Stanf ord-Binet ,  which  shifted  its  Scottish 
norms  2.28  intelligence  quotient  points  over  a  com¬ 
parable  period,  presumably  because  of  cultural  and 
educational  change  (Cattell,  1958).  The  Army  Alpha 
Test  showed  an  even  greater  shift  between  World  War  I 
and  II.  Another  promising  finding  is  that  of  Fowler 
(1955),  who  found  no  sex  differences  in  the  Cattell 
Test  on  a  large  urban  sample  which  had  significant  sex 
differences  on  the  Henmon-Nelson  and  Detroit  Alpha  Tests. 

Turning  to  international  comparisons,  the  few 
shreds  of  evidence  are  still  inconclusive.  Anastasi 
and  Cordova  (1953)  found  that  median  intelligence  quo¬ 
tients  of  Puerto  Rican  children  in  New  York  were  con¬ 
siderably  lower  than  Cattell Ts  norms,  but  attribute  the 
difference  to  low  socio-economic  status  and  bilinguali-sm 
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Drake  (1955)  reported  a  study  done  by  Edwin  Harper  in 
Allahabad,  India.  In  this  study,  the  mean  of  fifty-six 
Indian  college  students  on  the  Culture-Free  test  was 
equivalent  to  CattellTs  tenth  percentile,  and  many  stu¬ 
dents  apparently  ran  out  of  time. 

Klineberg  (1954)  emphasized  the  difficulties 
involved  in  such  cross-cultural  comparisons  by  point¬ 
ing  out  that  among  some  Indian  tribes  it  is  bad  form 
to  answer  a  question  in  the  presence  of  someone  else 
who  does  not  know  the  answer,  while  for  some  less  civi¬ 
lized  peoples,  speed  of  work  is  a  very  poor  motivator. 

To  reduce  such  difficulties,  Cattell  (1953) 
suggested  the  abolition  of  a  time  limit  for  cross- 
cultural  studies  and  individual  testing  to  minimize 
the  "co-operation  versus  competition"  conflicts.  These 
devices  may  reduce  the  problems  involved  but  demonstra¬ 
tion  of  their  effectiveness  awaits  further  research. 

On  the  whole,  there  are  enough  promising  results  repor¬ 
ted  to  warrant  further  study  of  the  test. 

The  Davis-Eells  Games  has  generally  met  with  less 
success  even  when  evaluated  in  terms  of  the  constructors 
purposes.  Smith  (1956)  and  Stillwell  (1953)  showed  that 
it  differentiated  between  social  classes  just  as  much  as 
the  California  Test  of  Mental  Maturity,  one  of  the  cul¬ 
ture-loaded  tests  of  the  Eells’  study  (1951).  Love  and 
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Beach  (1957)  showed  similar  results  when  the  Kuhlmann- 
Anderson  Test  was  used  for  comparison,  and  pointed  to 
disadvantages  in  what  the  author  called  emotional 
loading  of  several  items  and  the  importance  of  small 
pictorial  details. 

Burt  (1959  a)  pointed  out  that  the  author’s  def¬ 
inition  of  problem  solving  capacity  is  too  vague  to 
permit  evaluation  of  face  validity,  but  added  that 
culture-fairness  is  scarcely  indicated  when  familiarity 
with  the  conventions  of  the  strip  cartoon  are  necessary 
for  understanding  some  of  the  items.  The  relaxed  atmos¬ 
phere  in  which  the  test  is  administered  may  well  affect 
the  reliability,  which  is  relatively  low  for  a  ninety- 
minute  test.  Geist  (1954)  showed  a  discouraging  zero 
correlation  with  the  Stanford-Binet  on  an  upper  social 
class  sample  of  California  children  in  Grades  II  to  VI, 
while  Guilford  (195&),  in  the  absence  of  empirical 
evidence  to  the  contrary,  asserted  that  the  test  measures 
social  intelligence  and  acquired  information. 

Fowler  (1955)  showed  one  encouraging  result  for 
the  test  in  the  form  of  a  significantly  lower  correla¬ 
tion  with  social  status  when  compared  with  five  other 
intelligence  tests  on  an  American  urban  sample  of  201 
children.  When  Negro  and  Polish  children  were  added, 
however,  the  difference  in  correlations  was  insigni- 
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ficant.  By  and  large,  the  many  comparative  studies  done 
in  relation  to  social  status  suggest  that  the  authors’ 
purpose  has  not  been  achieved.  The  need  for  further 
attempts  in  the  same  field  are  emphasized,  however, 
even  by  those  most  critical  of  the  present  test. 

In  conclusion,  it  appears  that  Cattell’s  approach 
to  the  problem  of  reducing  cultural  differentials  by 
holding  past  learning  to  a  minimum  is  more  promising 
than  the  Davis-Eells’  attempt  to  hold  it  constant.  On 
this  assumption,  the  present  study  has  been  designed 
to  study  tests  similar  to  Cattell’s,  in  an  attempt  to 
identify  problems  which  tap  fundamental  processes  of 
intellectual  ability  while  holding  environmental 
influences  to  a  minimum. 

CONCLUSIONS 

lo  Large  differences  have  been  observed  in  the 
intelligence  scores  of  children  from  different  social 
strata. 

2.  These  differences  have  been  attributed  chiefly 

to  variation  in  genetic  equipment,  environmental  stimulation, 
and  test  bias ,  although  such  factors  as  motivation  and  test 
sophistication  have  been  shown  to  be  important. 

3.  Verbal  intelligence  tests  show  greater  differen¬ 
tiation  between  social  classes  than  non-verbal  reasoning 
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4.  An  intelligence  test  designed  to  reduce 
culture  differences  by  holding  them  constant  appears 
less  promising  than  one  designed  to  eliminate  or 
minimize  them. 

5.  If  the  use  of  such  a  test  could  be  popula¬ 
rized,  it  would  be  valuable  for 

a.  Identifying  talented  students  from  under¬ 
privileged  backgrounds. 

b.  Classifying  pupils  from  foreign  language 
backgrounds  for  appropriate  treatment. 

c.  Predicting  to  long-term  and  non-specific 
criteria,  particularly  when  change  in  treatments  is 
possible » 

d.  Improving  cross-cultural  comparisons. 

e.  Distinguishing  between  mental  defect  and 
reading  difficulties  of  a  sensory  or  emotional  nature. 

f.  Holding  intelligence  constant  in  research 
experiments  where  verbal  ability  is  irrelevant;  for 
example,  motor  learning. 
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CHAPTER  III 


THE  CONCEPT  OF  INTELLIGENCE 

The  Problem  of  Definition.  It  is  well  known  that 
psychologists  have  been  unable  to  agree  on  a  definition 
of  the  term  "intelligence."  Typical  of  the  meanings 
with  which  the  construct  has  been  invested  by  leading 
workers  in  the  field  are  those  which  emphasize  good 
judgment  (Binet,  1916);  abstract  thinking,  (Terman,  1921); 
adaptation  (Stern,  1912);  learning  ability  (Freeman, 

1940);  and  eduction  of  relations  and  correlates  (Spear¬ 
man,  1923).  Non-intellective  factors  are  included  in 
the  definitions  of  Wechsler  (1959),  Stoddard  (1942), 
and  psychologists  who  define  the  concept  as  "that  which 
intelligence  tests  measure"  (for  example,  Spiker  and 
McCandless,  1954)*  Burt  (1955)  on  the  other  hand,  re¬ 
stricts  the  meaning  of  the  construct  to  the  cognitive 
sphere,  and  insists  on  its  innateness  and  generality, 
while  Thorndike  (1925)  believed  that  intelligence  can 
be  defined  only  in  terms  of  physiology.  The  majority 
of  the  earliest  writers  have  largely  assumed  the  unitary 
nature  of  intelligence,  but  recent  research  has  empha¬ 
sized  the  multiple  factor  approach,  while  some  psycho¬ 
logists  have  denied  the  usefulness  of  a  general  factor. 


Hebb  (1949)  has  attempted  to  reduce  the  confu¬ 
sion  by  distinguishing  between  Intelligence  A  and 
Intelligence  B.  The  former  he  regarded  as  hereditary 
potential  which  is  determined  by  the  condition  of  one’s 
brain  and  nervous  system.  The  latter  indicates  the 
present  level  of  intellectual  functioning,  as  the  pro¬ 
duct  of  the  interaction  of  Intelligence  A  and  past 
experience.  Intelligence  A  can  be  estimated  but  not 
measured  because  no  two  individuals  are  subjected  to 
identical  environmental  stimulation.  Vernon  (I960) 
carried  the  analysis  one  step  further  by  adding  Intel¬ 
ligence  C,  the  estimate  obtained  from  one  intelligence 
test.  Such  distinctions  may  serve  as  useful  reminders 
that  single  estimates  of  Intelligence  B  are  unreliable, 
and  of  Intelligence  A  are  even  more  so.  Nevertheless, 
they  are  frequently  overlooked  in  psychological  and 
popular  discussion  and  argument  about  the  nature  of 
intelligence  continues 0  Such  widespread  disagreement 
indicates  the  importance  of  a  clear  description  of  the 
concept  of  intelligence  for  the  purpose  of  this  inves¬ 
tigation. 

It  should  be  recognized  that  intelligence  is 
primarily  an  intervening  variable  inferred  from  certain 
behavioural  acts  of  human  beings.  It  is  a  characteristic 
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not  of  a  person,  but  of  his  behaviour.  therefore,  it 
is  meaningless  to  assert  that  one  individual  possesses 
more  of  it  than  another;  rather,  he  acts  intelligently 
more  often,  or  more  consistently.  Intelligence  has 
never  been  observed,  and  as  Slater  points  out,  "if  any 
two  people  make  contrary  statements  about  it,  it  cannot 
be  examined  to  see  which  of  the  statements  is  correct." 
(194$,  p.  20) 

If  the  criterion  of  correspondence  with  objective 
reality  cannot  be  applied  to  the  concept  to  refine  its 
meaning,  then  we  must  invoke  other  criteria.  A  more 
popular  standard  has  been  that  of  usefulness  for  prac¬ 
tical  prediction  to  various  kinds  of  criteria,  and  for 
the  theoretical  purpose  of  explaining  individual  and 
group  differences  in  test  scores <> 

As  more  evidence  comes  to  hand,  the  construct 
will,  no  doubt,  have  to  be  revised  or  replaced  by  an¬ 
other  aet  of  concepts  at  a  higher  level  of  explanation. 

In  such  a  way  were  the  concepts  of  physics  refined  from 
the  time  of  Gallileo  and  Kepler,  through  Newton  to  Ein¬ 
stein.  However,  as  long  as  the  construct  of  intelligence 
has  classificatory  and  predictive  value,  regardless  of 
its  level  of  explanation,  it  will  continue  to  be  used 
by  psychologists. 
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Intelligence  Regarded  Statistically,  When  the 
pioneer  mental  tests  of  Binet  and  Spearman  showed  that 
correlation  rather  than  compensation  was  a  character¬ 
istic  of  superior  performance  in  a  wide  variety  of  what 
our  culture  regards  as  intellectual  tasks,  the  variable 
"intelligence”  was  postulated  to  account  for  the  major 
portion  of  the  variance  in  the  test  results.  Indivi¬ 
duals  were  shown  to  differ  in  such  a  consistent  pattern 
that  a  global  construct  of  general  intelligence  appeared 
meaningful.  Presumably,  if  positive  correlations  had 
not  been  observed  between  the  majority  of  mental  tests, 
the  term  intelligence  would  have  been  superfluous,  and 
narrower  abilities  may  well  have  been  substituted  for 
it.  Indeed,  many  factor  analysts  (for  example,  Guil¬ 
ford,  1956)  impressed  with  the  multifarious  dimensions 
of  mental  organization,  have  already  questioned  the 
usefulness  of  the  concept  as  a  general  one.  However, 
its  statistical  refinement  into  numerous  dimensions  has 
not  destroyed  its  usefulness  as  a  unitary  concept  in 
long-term  prediction  to  non-specific  and  changing  crit¬ 
eria,  especially  in  heterogeneous  populations  of  school 
children. 

Binet’s  own  interpretation  of  the  concept  departed 
from  the  level  of  behavioural  referents  when  he  distin- 
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guished  between  "intelligence  pure  and  simple"  and  the 
other  characteristics  which  his  test  measured.  (1916) 
Apparently,  he  depended  on  introspection  for  his  des¬ 
cription  of  intelligence,  for  it  had  first  to  be 
stripped  of  irrelevant  learnings  and  "scholastic  acqui¬ 
sitions,"  which  he  regarded  as  necessary  media  for  an 
attempted  estimate  of  intelligence.  Binet  was  making 
the  first  crude  distinction  between  Intelligence  A  and 
B.  Later  psychologists  differed  over  the  significance 
of  the  distinction.  Some  leaned  to  an  operational 
approach,  with  the  implied  inclusion  of  the  non-cogni- 
tive  and  acquired  characteristics  which  the  test 
instruments  sampled,  while  others  sought  to  isolate 
the  construct  mathematically  by  reducing  the  importance 
of  the  test  medium,  and  by  concentrating  on  the  cogni¬ 
tive  source  of  the  test  variance. 

Spearman1 s  "g".  An  early  "purist"  who  requires 
consideration  was  Spearman,  who  pioneered  the  factor- 
analytic  movement  of  modern  psychometric  research. 
Spearman  received  credit  for  noticing  the  hierarchical 
nature  of  many  intercorrelation  matrices  of  ability 
tests,  and  devising  the  tetrad  equations  which  enabled 
him  to  demonstrate  the  presence  of  the  general  factor, 
an  important  statistical  concept  which  could  account 
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for  the  majority  of  the  variance  of  many  test  results 
in  the  cognitive  domain.  Impressed  by  the  frequent 
recurrence  of  this  general  construct,  he  proposed  his 
two-factor  theory  to  explain  the  positive  correlation 
between  mental  abilities. 

Thus,  all  mental  problems  were  said  to  call  on 
the  activity  of  two  hypothetical  factors.  One  of  these 
is  common  to  the  majority  of  cognitive  tasks,  and  being 
general  in  nature  is  called  "g".  The  other,  the  ,TsIT 
factor,  is  specific  to  the  task  involved  and  can  be  re¬ 
garded  as  independent  of  g  and  of  all  other  sTs. 
Reflecting  upon  the  nature  of  the  functions  which  tests 
of  consistently  high  g  loadings  sampled,  Spearman  con¬ 
cluded,  by  means  of  J.  S.  Mills  logic  of  " agreement  and 
difference,”  that  two  of  the  essential  characteristics 
of  g  behaviour  are  "the  eduction  of  relations  and  cor¬ 
relates."  Thus,  an  arithmetic  problem  which  calls  upon 
a  pupil  to  discover  the  significant  relations  of  the 
data  given,  and  to  produce  something  new  with  the  same 
relations  is  a  task  more  highly  loaded  with  Spearmans 
g  than  an  arithmetic  problem  which  calls  only  upon 
knowledge  of  mutiplication  tables.  Spearman  concluded 
that  a  task  involves  the  general  factor  in  the  same 
proportion  as  it  involves  the  discovery  of  relations 
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and  the  eduction  of  correlates. 

Although  he  never  completely  identified  g  with 
intelligence  in  view  of  the  "equivocality”  of  the  latter 
term,  he  did  state  that  these  two  principles  essential 
to  g  "most  conspicuously  deserve  the  name  of  intelligence" 
(Spearman,  1923,  p.  352).  Moreover,  his  intelligence 
tests  were  designed  to  maximize  the  g  loadings. 

Certainly,  there  is  plausibility  in  this  position, 
when  it  is  considered  that  factor  analysis,  which  sorts 
abilities  into  homogeneous  groups,  shows  that  g,  or  its 
approximation,  is  the  common  denominator  of  a  large  var¬ 
iety  of  mental  tasks.  Whatever  intelligent  behaviour  is, 
it  certainly  should  be  extensively  tapped  by  a  broad 
sample  of  mental  tests.  To  support  Spearman1 s  position, 
Brown  and  Stephenson  (1933)  were  able  to  show  that  it  is 
possible  to  extend  a  battery  of  three  tests  which  psycho¬ 
logical  judgment  deemed  to  be  "intelligence  tests"  to, 
the  point  where  twenty  heterogeneous  instruments  were  in¬ 
cluded,  all  of  which  produced  tetrad  differences  near 
zero,  within  the  limits  of  sampling  error.  It  is  un¬ 
likely  that  this  could  have  been  accomplished  unless 
the  initial  trio  were  tests  of  functions  other  than  a 
very  general  ability. 

History  has  tended  to  condemn  Spearman  for  his 
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neglect  of  group  factors,  while  praising  him  for  the 
discovery  of  ,Tg".  It  is  true  that  Spearman  minimized 
the  importance  of  primary  abilities  of  less  generality 
than  g,  but  it  should  be  noted  that  he  did  not  deny 
them  entirely;  in  fact,  he  devoted  the  latter  part  of 
his  last  book  (1950)  to  their  description*  Thurstone, 
who  disagreed  with  Spearman  over  the  importance  of  the 
general  component,  showed  the  possibility  of  compromise 
when  he  demonstrated  the  presence  of  a  second  order 
factor  amongst  his  overlapping  primary  abilities  (1947). 
That  Spearman1 s  g  and  ThurstoneTs  second  order  factor 
could  be  regarded  as  similar  in  nature  has  been  demon¬ 
strated  by  such  research  workers  as  Rimoldi  (194$), 
Schmidt  (1957),  Bernyer  (1959),  and  others.  It  is  to 
be  regretted  that  the  initial  disagreements  shown  by 
these  two  pioneers  of  research  could  not  be  tracked 
down  earlier  to  differences  in  the  size  and  homogeneity 
of  the  population  sampled,  and  to  the  particular  factor 
analytic  procedures  employed. 

On  the  credit  side  of  Spearman* s  legacy,  the  pos¬ 
tulation  of  g,  both  as  a  statistical  quantity  and  a 
psychological  attribute,  has  been  widely  accepted.  The 
British  school  of  psychologists  have  adopted  it  almost 
unanimously  as  a  statistical  construct  which,  in  broad 
samples  of  mental  tests,  accounts  for  the  major  portion 
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of  the  variance.  Thus,  Vernon  (1950)  showed  in  each  of 
eight  separate  analyses  of  varied  ability  tests  on  un¬ 
selected  samples  of  armed  services  recruits  that  the  g 
factor  covered  more  than  twice  as  much  variance  as  all 
other  factors  combined.  Thomson  agrees  that  .  .  .  "all 
abilities  involve  more  or  less  g,T  (1951,  p.  58)  but 
prefers  to  restrict  its  interpretation  to  mathematical 
terms  only.  Burt  points  out  that  the  general  factor  in 
analyses  of  ability  tests,  whatever  its  label  (g  or 
intelligence)  is  most  valuable  for  predicting  efficiency 
in  concrete  tasks,  for  .  .  .  "all  types  of  intellectual 
achievement  are  correlated  with  it  in  varying  degrees" 
(1959  c). 

In  America,  Spearman’s  g  has  also  been  widely 
adopted.  Terman,  whose  influence  on  our  concepts  of 
intelligence  is  probably  unparalleled,  made  the  follow¬ 
ing  statement  in  a  public  address  before  his  death: 

I  am  convinced,  that  to  achieve  greatly  in  almost 
any  field,  special  talents  have  to  be  backed  up  by  a  lot 
of  Spearman’s  g,  by  which  is  meant  the  kind  of  general 
intelligence  that  requires  ability  to  form  many  sharply 
defined  concepts,  to  manipulate  them  and  to  perceive 
subtle  relationships  between  them.  (1954,  p.  224). 

Apparently  Terman’ s  concept  of  intelligence  was  very 

similar  to  that  of  Spearman. 

Wechsler,  another  American  psychologist,  claims 
that  he  is  still  an  adherent  of  Spearman’ s  but  insists 
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that  there  is  more  to  intelligence  than  g.  (195$) 

Cattell,  too,  has  adopted  Spearman’s  g  as  a  useful 
construct  in  his  verbal  and  non-verbal  tests  of  intel¬ 
ligence  (1943)*  Since  the  tests  of  these  psychologists 
have  proven  their  usefulness,  and  have  contributed  much 
to  current  behavioural  conceptions  of  intelligence,  the 
concept  of  g,  around  which  they  were  designed,  should 
be  closely  considered  in  studies  of  intelligence  test 
validity. 

Group  Factors  of  Ability.  As  early  as  1917, 

Burt  produced  evidence  to  suggest  that  variables  other 
than  a  general  factor  must  be  postulated  to  explain  the 
correlations  among  mental  tests  on  large  groups  of  school 
children.  The  major  factors  to  appear  in  this  study, 
apart  from  g,  were  verbal,  numerical,  and  practical. 
(Vernon,  1950).  Kelley  corroborated  his  results  in  a 
192$  study,  adding  memory,  spatial,  and  speed  factors 
to  the  list. 

Thurstone,  however,  revealed  no  g  in  his  first 
major  analyses  of  the  abilities  of  college  students 
(193$).  Instead,  he  showed  that  his  eight  primary  fac¬ 
tors  could  reproduce  the  original  correlation  matrix. 
Nevertheless,  Eysenck,  in  a  re-analysis  (1939),  showed 
that  even  in  such  a  relatively  select  group  as  Thurstone 
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worked  with,  the  g  factor  could  be  identified  and  that 
it  actually  accounted  for  more  variance  than  all  the 
group  factors  combined 0  Thus,  Thurstone  had  not  elim¬ 
inated  g  from  the  domain;  he  had  spread  it  amongst  his 
eight  factors  in  a  method  of  analysis  which  is  just  as 
legitimate  as  that  used  by  the  British  workers.  The 
resolution  of  this  difference  of  interpretation  could 
be  decided,  then,  only  on  the  grounds  of  psychological 
meaning  and  usefulness. 

Later  American  studies  by  Thurstone,  Guilford, 
and  their  respective  colleagues  revealed  many  more 
group  factors  of  ability,  both  broad  and  narrow,  their 
appearance  depending  chiefly  upon  the  homogeneity  of 
the  population  sampled,  and  on  the  kinds  of  tests  used 
in  the  battery.  'Thurstone  (1947)  >  demonstrated  the 
correlation  of  his  primary  abilities,  while  Crawford 
(1947)  showed  that  the  average  g  loading  for  the  tests 
in  Thurstone’ s  eighth  grade  studies  was  over  .5.  Such 
findings  made  compromise  with  the  English  analysts  fea¬ 
sible  although  the  personal  flavour  of  the  conflict- 
made  the  chief  protagonists  reluctant  to  admit  it. 
Nevertheless,  the  reconciliation  position  advocated 
by  Burt,  and  well  described  by  Vernon  (1950)  has  been 
widely  accepted.  It  is  this  position  which  is  proposed 


. 

, 

' 

t  ■ 

. 

o  ■ 

.  '  ,  :  ■  V  • 

„ 

,  .l 


1  .  < 


.Vs-:-) 

.  .  ‘  '  ■  -l-  *  .  J 

'J  > 

,  •  '  .V  ; 


43 


by  the  present  writer,  as  a  reasonable  construct 
around  which  to  build  up  a  "nomological  network"  for 
the  concept  of  intelligence. 

Hierarchy  of  Factors .  In  the  discussion  of  the 
concept  of  intelligence,  it  was  pointed  out  that  the 
construct  is  a  hypothetical  one  of  controversial  mean¬ 
ing.  Statistically,  it  is  regarded  by  the  present 
writer  as  a  covering  term  for  a  hierarchy  of  factors, 
arranged  in  descending  ondter  of  prominence  according 
to  their  appearance  in  factor  analytic  studies  of  men¬ 
tal  ability,  i'hese  factors  are  primarily  statistical 
constructs,  having  a  possible  but  not  a  necessary 
correspondence  with  real  powers  or  faculties.  An  indi¬ 
vidual  acts  in  a  problem  solving  situation  as  if  his 
behaviour  were  determined  by  the  factors  of  the  hier¬ 
archy  in  varying  proportions. 

At  the  top  of  the  hierarchy  is  a  general 
intellectual  factor,  comparable  to  Spearman’s  g  or  the 
second-order  factor  of  an  oblique  analysis,  and  sampled 
extensively  by  most  items  in  general  ability  tests.  It 
can  thus  be  viewed  as  a  common  denominator,  or  under¬ 
lying  dimension  which  accounts  for  a  major  portion  of 
the  variance  found  in  a  large  variety  of  complex 
intellectual  tasks,  as  defined  in  our  culture. 
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At  the  second  level  are  found  the  major  group 
factors  which  are  invoked  to  explain  variance  over  and 
above  g.  Most  frequently  encountered  in  research  have 
been  verbal,  numerical,  spatial,  and  mechanical  factors; 
but  non-cognitive  dimensions  such  as  persistence,  speed, 
industry,  difficulty,  and  schooling  may  well  be  dis¬ 
covered  at  this  and  lower  stages 0  The  inclusion  of 
such  factors  in  a  concept  of  measured  intelligence 
appears  meaningful,  as  long  as  they  influence  the  res¬ 
ults  obtained  on  ability  tests » 

Lower  levels  of  the  hierarchy  can  be  more  clearly 
identified  when  relatively  homogeneous  tests  and  popula¬ 
tions  are  used.  However,  the  conditions  imposed  on  their 
appearance  make  them  relatively  less  variant,  and  their 
usefulness  is  greatest  in  the  construction  of  instrum¬ 
ents  designed  for  predicting  to  very  specific  criteria,, 
Guilford  has  indicated  the  tremendous  diversity  of  men¬ 
tal  structure  in  his  recent  research  bn  high  level 
personnel  (1956). 

Such  a  hierarchical  picture  must  be  regarded 
only  as  a  tentative  hypothesis  rather  than  an  estab¬ 
lished  conclusion,  since  it  depends  upon  the  nature  of 
the  tests  used,  and  the  population  sampled.  Support 
for  its  postulation  is  found  in  the  frequent  recurrence 
of  a  g  and  the  major  group  factors,  whenever  a  hetero- 
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geneous  population  is  tested  with  a  broad  variety  of 
ability  tests.  In  Vernon’s  words,  it  is  "a  hypothet¬ 
ical  integration  of  all  the  factorial  investigations 
that  have  been  carried  out.”  (1950,  p.  26) 

The  amount  of  variance  which  each  factor  may 
account  for  will  differ  somewhat  with  each  analysis, 
and  no  absolute  lines  can  be  drawn  between  broad  and 
narrow  group  factors.  However,  it  is  obvious  that 
tests  which  add  less  than  two  or  three  per  cent  to 
the  variance  of  an  ability  in  a  multiple  prediction 
situation,  are  of  limited  practical  value.  Since  g 
commonly  accounts  for  fifty  to  seventy  per  cent  of  the 
common  variance  in  heterogeneous  samples  of  elementary 
school  children,  while  group  factors  contribute  less 
than  half  this  amount,  it  is  clear  that  a  test  highly 
saturated  with  g,  with  negligible  group  factor  loadings, 
may  well  serve  as  a  useful  predictor  to  a  non-specific 
criterion,  such  as  university  entrance  or  special  class 
schooling. 

Since  the  prominence  of  the  general  factor  has 
frequently  been  less  marked  with  older  children  and 
adults,  the  hypothesis  of  differentiation  of  general 
ability  with  age  has  been  widely  entertained  (Garrett, 
1946).  However,  much  of  the  difference  can  be  explained 
in  terms  of  selectivity  of  the  population  sampled, 
while  the  difficulty  of  finding  tests  equally  approp¬ 
riate  at  all  ages,  yet  similar  in  content,  would  appear 
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to  obstruct  the  collection  of  data  to  substantiate  the 
hypothesis.  On  the  basis  of  Ferguson’s  theory  (1954), 
which  accounts  for  the  stability  of  ability  factors 
in  terms  of  over-learning,  the  differentiation  hypo¬ 
thesis  certainly  appears  plausible.  Whatever  the  fate 
of  these  theories,  the  frequently  observed  reduction 
in  g  variance  with  age  suggests  that  its  greatest  use¬ 
fulness  as  a  construct  is  to  be  found  in  the  testing  of 
heterogeneous  groups  of  children. 

Behavioural  Referents  of  Intelligence .  For  the 
man  in  the  street,  intelligence  certainly  has  meaning 
over  and  above  the  statistical  interpretation.  Intel¬ 
ligent  behaviour  is  recognized,  for  instance,  in  situa¬ 
tions  demanding  sound  judgment,  abstract  thinking, 
speedy  adaptation  or  the  sizing  up  of  a  situation.  It 
is  difficult,  however,  to  use  any  one  or  combination 
of  these  concepts  for  validating  or  improving  intelli¬ 
gence  tests  because  their  behavioural  referents  are, 
too,  in  doubt.  Pursuance  of  this  course  usually  ends 
in  irrelevant  semantic  problems,  which  hinder  rather 
than  help  our  refinement  of  the  concept. 

It  could  well  be  argued  that  behavioural  refer¬ 
ents  are  unnecessary  to  the  concept  of  intelligence. 

As  long  as  it  serves  a  purpose  in  human  prediction 
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situations  it  can  be  described  merely  as  the  major 
source  of  common  mental  test  variance,  at  least  until 
its  neurological  correlates  are  discovered.  However, 
administrators,  clinicians,  research  workers,  and  test 
constructors  working  with  intelligence  tests  in  new 
situations  need  some  indication  of  what  the  tests  are 
measuring  in  order  to  know  when  and  where  they  may  be 
most  useful.  A  teacher  who  uses  a  standardized  arith¬ 
metic  test  to  group  his  class  for  better  teaching  of 
arithmetic  believes  in  advance  that  the  test  will  help 
him  do  this  because  he  knows,  generally,  what  the  test 
is  measuring.  To  suggest  to  the  administrator  that  he 
uses  intelligence  tests  to  group  his  classes  for  a 
variety  of  subjects,  however,  provokes  the  question 
,TWhat  will  these  tests  measure?” 

In  the  light  of  our  statistical  position,  a 
reasonable,  if  cautious,  answer  would  appear  to  be  nthe 
common  denominator  of  many  complex  intellectual  abilities 
plus  a  few  of  the  more  common  abilities  themselves.” 

Since  intelligence  tests  vary  considerably  in  the  weight 
they  assign  to  the  various  group  factors,  some  tests  may 
be  more  useful  for  the  purpose  of  the  school  adminis¬ 
trator,  some  for  the  counsellor,  some  for  the  clinician. 
Where  the  criterion  is  known  to  emphasize  verbal  and 
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numerical  skills,  such  as  school  learning,  and  treat¬ 
ment  is  relatively  fixed  rather  than  adaptive,  conven¬ 
tional  verbal  tests  of  intelligence  will  classify  and 
predict  skilfully.  However,  if  the  criterion  is  vague 
or  far-distant,  and  if  the  possibility  of  change  in 
treatment  is  great,  the  best  predictors  should  be  those 
which  give  better  estimates  of  an  individual’s  general 
ability,  apart  from  the  more  specific  skills  which  the 
tests  estimate. 

Thus,  to  assist  the  administrator  in  his  under¬ 
standing  of  the  behaviour  referents  of  intelligence 
tests,  we  must  first  discover  his  purposes  in  testing, 
then  choose  the  instrument  which  in  the  light  of  empi¬ 
rical  evidence  best  performs  this  function,  and  describe 
the  content  in  terms  of  factor  analysis  and  content 
inspection.  In  other  words,  intelligence  tests  measure 
along  several  dimensions  of  behaviour  and  necessarily 
include  non-intellectual  factors.  With  this  in  mind, 
there  can  be  described  no  one  set  of  behavioural  refer¬ 
ents  for  intelligence.  The  construct  is  necessarily 
vague  because  there  are  so  many  different  kinds  of 
intellectual  tasks  in  which  an  individual  can  display 
more  or  less  intelligent  behaviour. 

The  General  Factor.  If  it  were  possible  to  dis¬ 
cover  an  instrument  which  could  measure  intelligence 
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with  as  much  reliability  and  objectivity  as  a  barometer 
measures  atmospheric  pressure,  it  would,  on  the  basis 
of  our  present  evidence,  measure  only  a  general  factor. 
For  insofar  as  it  measured  group  factors,  it  would  be 
adding  a  dimension  which  was  uncorrelated  with  a  number 
of  other  intellectual  skills.  Cattell  (1943)  has 
proposed  that  a  measure  of  g  saturation  be  our  sole 
criterion  for  validating  intelligence  tests.  This 
suggestion  requires  the  assumption  that  intelligence  is 
a  unitary  factor,  an  assumption  which,  if  incorporated, 
would  lower  a  test’s  predictive  validity  for  criteria 
which  stress  group  factors  such  as  school  or  college 
grades.  Tests  designed  according  to  this  standard 
would  not  conform  to  our  definition  of  intelligence; 
nevertheless,  the  proposal  has  merit  if  a  measure  of 
broad  intellectual  ability  apart  from  specific  learned 
abilities  is  desired.  Since  this  is  the  kind  of  con¬ 
struct  required  for  the  present  investigation,  evidence 
on  the  nature  and  usefulness  of  the  general  factor  must 
now  be  considered. 

Spearman  and  numerous  investigators  who  followed 
him  have  demonstrated  that,  whatever  processes  the 
general  factor  in  ability  test  batteries  represents, 
they  must  be  complex.  Tests  with  high  g  loadings  con¬ 
sistently  require  subjects  to  see  and  use  difficult 
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relations,  to  abstract  and  synthesize,  to  manipulate 
symbols  of  various  kinds.  By  contrast,  tests  such  as 
spelling,  counting,  knowledge  in  specific  fields,  and 
simple  sensory  processes  regularly  show  low  g  satura¬ 
tions.  Therefore,  we  can  assume  that  the  general 
factor  reflects  the  operation  of  complex  mental  activ¬ 
ities,  as  opposed  to  simple  sensory  processes. 

A  more  important  characteristic  of  g  is  its 
relative  immunity  to  environmental  influences.  This 
was  demonstrated  by  Woodrow  (193$)  when  he  showed  that 
practice  in  each  of  seven  tests  over  a  period  of  thirty- 
nine  days  affected  the  group  factors  found  in  a  thirty- 
three  test  battery,  but  left  the  g  component  relatively 
unchanged.  Cattell  reached  a  similar  conclusion  when 
he  showed  that  his  test  of  g  was  quite  insensitive  to 
cultural  change  over  a  test-retest  interval  of  thirteen 
years,  while  the  more  verbal  Stanford-Binet  showed  a 
significant  change  in  norms  over  a  comparable  period 
(Cattell,  195$). 

Another  instrument  which  has  frequently  been 
shown  to  measure  g  and  little  else  is  the  Progressive 
Matrices.  In  a  recent  study,  this  test  has  strength¬ 
ened  the  case  for  the  stability  of  g  by  showing 
significantly  lower  negative  correlations  with  family 
size  than  did  verbal  intelligence  and  achievement 
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tests.  The  investigator,  Nisbet  (1953),  found  prog¬ 
ressive  increases  in  correlations  between  verbal 
intelligence  and  family  size  as  the  age  of  the  children 
increased,  suggesting  that  the  variance  which  the  family 
size  accounted  for  was  influenced  by  environment. 
Therefore,  by  virtue  of  the  low  correlation  of  the 
Matrices  test  with  family  size,  it  seems  reasonable  to 
argue  that  whatever  processes  the  test  sampled  were  less 
affected  by  environmental  stimulation. 

Other  comparisons  of  verbal  and  non-verbal 
intelligence  tests  support  the  case  for  the  stability 
of  the  g  factor  irrespective  of  environmental  differ¬ 
ence.  Thus,  Gow  (1925)  showed  that  the  English  canal 
boat  children  whose  Stanford-Binet  intelligence  quo¬ 
tient  declined  to  sub-normality  with  age  (Gordon,  1923) 
were  not  subnormal  on  non-verbal  tests.  Similarly, 
Burnett Ts  (195$)  comparative  study  of  Grade  IX  samples 
of  children  from  St.  JohnTs,  Halifax,  and  Newfoundland 
outposts  showed  that  the  Raven  Matrices’  mean  did  not 
reflect  the  differences  in  cultural  opportunities  as 
much  as  the  Wechsler  Bellevue  Performance  intelligence 
quotients. 

Considered  collectively,  the  foregoing  findings 
suggest  that  the  g  factor,  unlike  the  group  factors  of 
verbal  and  numerical  abilities,  is  relatively  insensitive 
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to  environmental  influence.  If  this  generalization 
can  be  accepted,  g  is  obviously  an  important  and  use¬ 
ful  concept  in  the  study  of  human  ability. 

To  summarize,  g  as  a  statistical  construct  can 
be  well  estimated  in  relative  isolation  from  group 
factors  (Raven,  1954;  Cattell,  195$).  If  the  appro¬ 
priate  analytic  techniques  are  employed,  g  appears 
invariably  whenever  a  broad  sample  of  mental  tests  is 
administered  to  a  heterogeneous  group.  It  is  exemp¬ 
lified  in  behaviour  of  a  complex  rather  than  a  simple 
kind,  is  relatively  stable  and  independent  of  differ¬ 
ences  or  changes  of  environment,  and  is  of  considerable 
value  in  predicting  to  non-specific  criteria.  (Vernon 
and  Parry,  1950;  Burke,  195$).  Many  test  constructors, 
including  Terman,  Wechsler,  Cattell,  and  the  majority 
of  British  and  French  psychologists,  have  been  suffi¬ 
ciently  impressed  with  the  usefulness  of  the  concept 
of  g  to  use  it  as  a  criterion  for  intelligence  test 
design.  The  foregoing  considerations  suggest,  too, 
that  if  tests  of  g  could  be  purified  sufficiently,  they 
would  be  much  less  subject  to  the  criticisms  of  intel¬ 
ligence  quotient  inconstancy  commonly  advanced  against 
the  conventional  verbal  intelligence  test.  For  much  of 
this  inconstancy  could,  no  doubt,  be  attributed  to  the 
differing  weights  assigned  to  group  factors  and  to  a 
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greater  sensitivity  to  environmental  change. 

In  view  of  these  characteristics  g  has  been 
selected  as  a  major  criterion  for  the  validity  of 
the  intelligence  tests  used  in  this  investigation. 


. 
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CHAPTER  IV 


SOCIO-ECONOMIC  STATUS 

Diversity  of  Approaches .  Various  indices  of 
social  class  and  socio-economic  status  have  been  popu¬ 
larized  by  sociologists  in  recent  years.  Barber  (1957) 
has  classified  them  as: 

1.  How  people  evaluate  one  another. 

2.  How  people  associate  with  one  another. 

3.  The  material  possessions  and  symbolic 
activities  people  display. 

Since  agreement  is  lacking  on  definitions  and 
criteria  of  social  class  and  socio-economic  status,  it 
is  necessary  to  review  some  of  the  relevant  research 
to  estimate  the  value  of  the  different  kinds  of  indices 
for  the  purpose  of  this  investigation. 

W.  Lloyd  Warner  (1949),  a  pioneer  in  the  field 
of  social  class  measurement,  has  used  all  three  app¬ 
roaches  extensively  in  his  studies  of  social  strati¬ 
fication  in  New  England,  the  Deep  South,  and  the 
Middle  West.  Teams  of  field  workers  interviewed 
large  numbers  of  citizens  to  obtain  objective  and 
subjective  estimates  of  social  standing  of  themselves 
and  others  and  came  up  with  considerable  agreement  in 


\ . , ' 

.  -  - 


•  ll  .  . 

r‘:  ■.  s.oc  .  ,  .’.rt;  -’1 

:  r:,  . 

„  ■  :  l  - 

\  ,/  •  ;  ;•  •,(.  ■  .  V J:  </  '  ■  ■  "i  £•  i-' 


r  ,  .  ‘0  0<  :  ''  -  ’  ;  ■  ■■ 

£  I 


'  d  ■ 

\  ] 

d  .  .. 

! -• 

r  'L';  y  " 

6J 

.  '  ’  . 

< )  Cj 

J.  C  .  t 

. 

j  t ' 

' \ 

.  o  i 

'.;.o 

'  ■  : 

d'r 

:  1  1 

;r .  .  .  .r.‘i 

.  • 

. . 

i'j 

. 

55 


their  classifications.  Correlations  among  indices  of 
this  "Evaluated  Participation"  (E.P.)  procedure  were 
high,  but  the  criteria  were  not  clearly  difined,  the 
research  and  time  involved  was  tremendous,  and  the 
methods  were  limited  to  use  in  communities  small  enough 
for  people  to  know  one  another  well.  Consequently,  in 
his  Jonesville  study,  Warner  explored  the  possibility 
of  substituting  a  more  easily  obtained  and  objective 
measure  for  the  elaborate,  time-consuming  indices  used 
in  previous  investigations.  Using  a  weighted  four  item 
index  based  on  occupation,  source  of  income,  house  type, 
and  residential  location,  he  discovered  that  the  results 
obtained  from  this  more  economical  "Index  of  Status 
Characteristics"  (I.S.C.)  correlated  .97  with  the  com¬ 
plex  E.P.  procedure.  Moreover,  the  correlation  ob¬ 
tained  with  occupation  alone  was  .91;  with  source  of 
income  0&5;  with  house  type  .$5;  and  with  residential 
area  0$2. 

Occupational  Criteria.  Kahl  and  Davis  (1955), 
like  Warner,  have  well  demonstrated  the  usefulness  of 
an  occupational  scale  in  social  stratification  measure¬ 
ment  by  factor  analyzing  a  large  number  of  indices 
commonly  used  for  this  purpose.  They  confirmed  Warner* s 
conclusion  that  the  correlation  between  most  indices  is 
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high,  and  showed  that  the  variable  most  highly  loaded 
with  the  common  factor  they  appear  to  measure  is 
occupation* 

To  improve  the  rating  of  occupations,  Hatt  (1953) 
has  compiled  an  objective  occupational  scale  by  a 
national  standardization  of  ninety  occupations  ranked  by 
representatives  of  all  classes  and  groups  across  the 
United  States  in  terms  of  "social  standing*"  Inkeles 
and  Rossi  (1956)  have  found  correlations  greater  than 
•35  between  this  scale  and  similar  occupational  scales 
standardized  by  various  methods  in  such  industrialized 
countries  as  Great  Britain,  Germany,  Russia,  New  Zealand 
and  Japan*  Consequently,  there  are  several  useful 
criteria  with  which  to  compare  newly  developed  occupa¬ 
tional  scales. 

An  objective  scale  suitable  for  use  in  Canada 
has  been  compiled  by  Blishen  (1953)  on  the  basis  of 
national  census  data.  For  each  of  343  occupations, 
the  mean  income  and  number  of  years  schooling  required 
were  calculated;  the  standard  scores  on  the  two  vari¬ 
ables  were  combined  and  averaged,  resulting  in  a 
scale  ranging  from  thirty- two  to  ninety  with  a  mean 
of  approximately  fifty  and  a  standard  deviation  of 
ten.  According  to  Blishen,  this  scale  appears  to  give 
predominant  weight  to  the  amount  of  responsibility 
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involved  and  the  degree  of  training  required,  two 
factors  which  strongly  influence  subjective  rankings 
of  occupations. 

Empirical  justification  for  the  use  of  such  a 
scale  is  seen  in  the  correlation  of  .94  between 
BlishenTs  Scale  and  Hatt’s  "National  Opinions  Research 
Centre  Index."  Moreover,  Blishen  found  a  mean  corre¬ 
lation  of  .$5  between  his  Canadian  index  and  similar 
scales  standardized  in  Great  Britain,  New  Zealand, 
Japan,  and  Germany  by  a  variety  of  methods  (Blishen, 
195$).  In  the  light  of  these  findings,  Blishen’ s 
scale  was  selected  as  a  useful  instrument  to  obtain 
an  estimate  of  socio-economic  status  in  the  present 
study. 


The  Home  Index  Scale .  Some  investigators 
(for  example,  Gough,  1949)  have  insisted  that  occupa¬ 
tion  is  only  one  aspect  of  social  prestige,  and  have 
suggested  that  a  variety  of  approaches  to  the  problem 
is  likely  to  lead  to  more  meaningful  results.  In  his 
appraisal  of  the  various  methods  used  to  measure  social 
status,  Gough  maintained  that  the  Chapin  Living  Room 
Scale,  based  on  interviewers’  ratings  of  material 
possessions,  is  probably  the  most  objective,  but 
rather  time-consuming  and  expensive  for  large  samples . 
Using  the  questionnaire  approach,  Gough  produced  a  new 


f  fjj  '  -  X.  ■  :  -|  •  tH  '-V..V  VJ;i 

■ 

.  . 

' 

. 

..  ?  ‘  ■  .  -  ? 

■  ' 

. 

r  .  .  ■  •  '  " 

5 

,  '  ■'  ;  ' 

. 

:  .  ■  ■  ’ l.  ;  .  •  r; .  ~  • 

- 


„ 

! 

:  t 

c 

«  • 

. 

...  .  .  .  . 

V 

. 

» J  1  1  . 

c 

f  ,  • 

- 

"Home  Index"  scale,  suitable  for  educational  research 
with  large  numbers,  easy  to  administer  and  simple  to 
score.  In  his  own  words,  it  was  .  .  ."based  largely 
upon  a  re-analysis  and  re-working  of  items  in  the  Sims 
Score  Card  and  the  American  Home  Scale,  with  the  addi¬ 
tion  of  certain  original  items."  (1949,  p.  53).  The 
scale  consisted  of  twenty-one  items,  to  be  answered 
"yes"  or  "no;"  it  showed  a  re-test  reliability  of  .  9$9 
on  college  students,  and  a  Kuder-Richardson  co-effi¬ 
cient  of  ,74  on  high  school  students.  According  to 
the  Flesch  readability  standards  (Flesch,  194$)  it  was 
well  within  the  scope  of  the  average  junior  high  school 
child.  More  important,  it  correlated  »$$  and  .$2  with 
the  two  scales  on  which  it  was  based,  and  065  with  a 
measure  of  occupational  status. 

Consequently,  it  was  decided  to  adapt  Gough’s 
scale  for  use  in  the  present  investigation,  in  order 
to  supplement  the  occupational  index.  Then  by  com¬ 
bining  the  two  scores  with  equal  weighting,  a  useful 
and  sensitive  measure  of  socio-economic  status  was 
produced.  No  claim  is  made  that  such  a  procedure  will 
isolate  the  crucial  environmental  variables  operating 
to  accelerate  mental  maturity.  A  venture  of  this  kind 
is  beyond  the  scope  of  this  study.  It  is  merely  hoped 
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to  produce  a  scale  sensitive  enough  to  indicate  when 
a  number  of  these  stimulating  correlates  are  present 
and  operating,  directly  or  indirectly,  and  when  their 
importance  is  reduced.  The  evidence  cited  in  pre¬ 
vious  chapters  suggests  that  the  two  scales  described 
should  meet  these  requirements  adequately. 
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CHAPTER  V 


DEFINITIONS,  POSTULATES 
AND  HYPOTHESES 

The  theoretical  considerations  covered  in  the 
previous  four  chapters  now  require  formalization  into 
a  set  of  definitions,  postulates,  and  testable  hypo¬ 
theses  . 


Definitions . 

lo  For  the  purpose  of  this  study,  intelligence 
is  defined  statistically  as  a  hierarchy  of  factors 
derived  from  an  analysis  of  mental  test  results.  At 
the  top  of  the  hierarchy  is  a  general  intellectual 
factor,  which  accounts  for  a  large  portion  of  the 
variance,  and  is  sampled  by  all  mental  ability  tests. 

At  the  next  level,  group  factors,  such  as  verbal, 
numerical,  spatial,  and  mechanical  abilities  contri¬ 
bute  considerably  to  the  test  variance.  The  appearance 
of  these  and  lower-order  factors  depends  upon  the  test 
battery  and  population  sampled.  Mon-intellective 
factors  must  necessarily  be  included  in  a  statistical 
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2.  Socio-economic  status  is  defined  operation¬ 
ally  as  the  dimension  measured  by  the  Blishen  occupa¬ 
tional  scale  and  an  adaptation  of  the  Gough  Home  Index, 
in  equal  weights. 

3.  The  concept  of  "culture  loading"  refers  to 
the  extent  to  which  tests  are  dependent  upon  symbolism 
which  is  likely  to  be  more  familiar  to  some  testees 
than  others. 

4.  A  "culture-reduced"  test  is  one  which  holds 
verbal  symbolism  and  past  learning  to  a  minimum,  and 
uses  a  medium  which  is  relatively  unfamiliar  to  all 
examinees . 

Postulates . 

1.  Individual  differences  in  measured  intelli¬ 
gence  are  due  to  an  interaction  of  the  following  vari¬ 
ables  in  different  proportions. 

a.  Hereditary  predisposition:  some  quality  of 
the  genes  and  nervous  system  as  determined  at  the  moment 
of  conception. 

b.  Environment :  those  forces  which  impinge 
upon  the  organism  after  conception. 

c.  Content  of  test  items :  for  example,  verbal, 
numerical,  pictorial,  perceptual,  and  other  kinds  of 
items  show  different  discrimination  patterns  between 
large  segments  of  the  population. 
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d.  Other  systematic  factors :  such  as  test 
motivation,  test  sophistication,  and  function  fluc¬ 
tuation. 

e.  Chance  errors :  of  measurement. 

2.  Children  from  high  socio-economic  levels 
perform  significantly  better  on  conventional  intelli¬ 
gence  tests  than  children  from  low  status  levels. 

This  indicates  the  systematic  operation  of  some  or  all 
of  the  variables  (a)  to  (d)  under  (1)<> 

3.  Tests  and  items  which  depend  upon  verbal 
symbolism  and  acquired  information  differentiate  more 
clearly  between  socio-economic  levels  than  do  non¬ 
verbal  reasoning  testso 

4.  Intelligent  behaviour  can  be  displayed  in  a 
wide  variety  of  testing  situations. 

3.  The  general  factor  of  intellectual  ability 
can  be  adequately  sampled  through  restricted  and  un¬ 
familiar  media,  such  as  non-verbal,  perceptual  items, 
provided  they  call  forth  sufficiently  complex  mental 
processes.  Tests  of  this  nature  will  minimize  the 
importance  of  1  ( c ) . 

6.  The  ability  or  abilities  which  correspond  to 
the  general  factor  are  less  sensitive  to  changes  in 
environmental  stimulation  than  some  group  factor  abili¬ 
ties.  Therefore,  tests  which  sample  the  general  factor 
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only  will  reduce  the  emphasis  on  1  (b). 

7.  Tests  which  sample  group  factor  abilities  show 
more  irregular  patterns  of  item  discrimination  between 
socio-economic  levels  than  tests  of  the  general  factor. 

$.  Culture-reduced  tests  which  show  high  g  load¬ 
ings  and  insignificant  ability  or  environmental  loadings 
will  predict  as  efficiently  as  culture-loaded  tests  of 
intelligence,  when  non-specific  and  distant  criteria 
are  used. 

9.  By  minimizing  the  contribution  of  environ¬ 
mental  advantage  and  content  familiarity  to  test  score 
variance,  measures  which  sample  only  the  general  intel¬ 
lectual  ability  factor  will  maximize  the  importance  of 
hereditary  influences  in  determining  test  scores. 

10.  A  "Culture-reduced"  intelligence  test  should 
be  evaluated  by  the  following  criteria: 

a.  It  should  largely  sample  a  broad  general 
factor  of  intellectual  ability. 

b.  It  should  show  negligible  loadings  on  verbal, 
numerical,  and  other  group  factors. 

c.  It  should  show  lower  correlations  with  socio¬ 
economic  status  than  do  conventional  verbal  intelligence 
tests . 

d.  It  should  show  relatively  little  decline  with 


age  for  children  from  unfavoured  socio-economic  backgrounds, 
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and  correspondingly  little  increase  for  favoured  groups. 

e.  It  should  consist  of  items  which  can  be 
solved  in  any  language  or  mode  of  expression,  and  which 
are  likely  to  be  as  familiar  and  useful  for  one  group 
of  society  as  any  other. 

f.  It  should  show  a  regular  pattern  of  item 
discrimination  across  all  socio-economic  levels. 

g.  It  should  show  a  considerable  relationship 
with  school  achievement. 

Hypotheses .  The  principle  purpose  of  this  inves¬ 
tigation  could  be  described  as  exploratory.  To  identify 
tests  which  meet  certain  predetermined  criteria,  how¬ 
ever,  it  is  necessary  to  have  in  mind  some  hypotheses 
about  a  number  of  promising  testing  instruments.  The 
following  hypotheses  were  made  about  the  tests  adminis¬ 
tered  at  the  Grade  III,  VI,  and  VII  levels  to  the  sample 
of  Edmonton  children  included  in  this  study. 

I.  Certain  selected  culture-reduced  tests  have 
high  loadings  on  the  general  intellectual  ability 
factor  common  to  the  majority  of  ability  and  achievement 
tests  used  in  the  investigation.  The  specified  tests 
will  show  negligible  loadings  on  narrower  group  factors. 
To  test  this  hypothesis  two  predictions  were  made. 

a.  At  the  Grade  VII  level,  the  following  tests 
will  show  high  g  loadings  and  insignificant  loadings  on 
group  factor's. 
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i.  Raven’s  Standard  Progressive  Matrices 

(1956). 

ii.  Cattell’s  Test  of  g  -  Culture-Free,  Scale  2. 

iii.  The  three  sub-tests  and  the  total  score  of 
the  Lorge-Thorndike  Non-Verbal  Intelligence  Test,  Level  4. 

iv.  Sub-tests  7  and  $  of  the  Holzinger-Crowder 
Uni-factor  Battery. 

v.  The  Non-Language  Total  of  the  California 
Test  of  Mental  Maturity  (Elementary). 

b.  The  remaining  twelve  tests  and  sub-tests 
administered  in  Grades  VI  and  VII  will  show  significant 
loadings  on  one  or  more  ability  or  environment  factors 
in  addition  to  the  general  factor. 

2.  Culture-loaded  tests  differentiate  between 
socio-economic  levels  more  than  do  the  selected  culture- 
reduced  tests.  This  hypothesis  can  be  tested  through 
the  following  deduction: 

Socio-economic  status  of  father  will  show  greater 
correlations  with  scores  on  the  culture-loaded  tests 
than  it  does  with  scores  on  the  culture-reduced  tests 
at  each  grade  level  tested. 

3.  The  environmental  effects  of  socio-economic 
variables  are  such  that  over  a  period  of  four  years 
(1956  to  I960)  children  from  upper  social  levels  will 
significantly  improve  their  standing  on  a  culture-loaded 


c  -  ■ 


’ 

' 


/  ■  -  -  . 

.7  -■  -■■■<• 

-  . .  ■  1  -  * 

. 

. 


>  . • 


,/  ■  •  .  O  ■  .  .  .  :  .1  •  :  •  •  f:J:  .v  V  .V  j ■ , .b 


"■  - 

„ 

•  -  -- 

■  ... :  ■ ;  .  .  -  -  :  .  ~  •  •• 

: 

■  ■.  '  D  -  :  :  >.7 

...  (  .  '  C.  .  ’  .  : 

_  i  ■" 

.  .  .  ' 

. 

:  . 

1  i'  !  ■  v  f  V  I . 


*  intelligence  test,  but  not  on  a  culture-reduced  test. 

Thus  it  can  be  predicted  that: 

a.  Socio-economic  status  of  father  will  show 
the  same  correlations  with  Raven  Matrices  scores  at 
Grades  III  and  VII. 

b.  Socio-economic  status  of  father  will  show 
greater  correlations  between  California  Mental  Maturity 
scores  at  Grade  VII  than  at  Grade  III. 

c.  Socio-economic  status  of  father  will  show 
greater  correlations  with  change  in  California  Mental 
Maturity  T  score  than  with  change  in  Raven  T  score 
between  Grades  III  and  VII. 

4.  Culture-reduced  tests  will  show  a  more  regu¬ 
lar  pattern  of  item  discrimination  between  socio¬ 
economic  groups  than  culture-loaded  tests.  This 
hypothesis  leads  to  the  following  deduction: 

When  the  Grade  VII  sample  is  categorized  into 
three  groups  on  the  basis  of  socio-economic  status,  an 
item  analysis  of  the  most  promising  culture-reduced 
tests  will  show  that  upper  and  lower  thirds  succeed  in 
similar  ratios  on  each  item.  On  the  California  Mental 
Maturity  Verbal  Test,  discrimination  indices  will  vary 
significantly  more  from  item  to  item. 

The  testing  of  these  seven  deductions  constitutes 
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the  core  of  the  analysis  of  this  dissertation. 
Additional  problems  have  been  investigated,  however, 
in  an  attempt  to  throw  more  light  on  the  construct 
validity  of  the  most  promising  culture-reduced  tests. 
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CHAPTER  VI 


EXPERIMENTAL  DESIGN  AND  PROCEDURE 

The  investigation  could  be  briefly  described 
as  both  longitudinal  and  cross-sectional  in  its  approach. 
Ability  and  achievement  tests  were  examined  in  relation 
to  socio-economic  status  at  Grades  III,  VI,  and  VII  by 
means  of  correlations,  factor  analyses,  comparison  of 
means,  item  analyses,  and  change  w ith  age.  The  same 
children  were  tested  three  times,  twice  in  Edmonton¬ 
wide  public  school  surveys  conducted  by  the  Faculty  of 
Education  of  the  University  of  Alberta,  and  thirdly, 
by  the  present  investigator  in  a  study  restricted  to 
three  schools  at  the  Grade  VII  level. 

The  Population  Sampled.  The  children  tested 
were  all  pupils  of  public  schools  in  the  city  of  Edmon¬ 
ton  and  to  this  extent  were  not  representative  of  the 
general  population.  Numerous  studies  (Anastasi,  195$) 
have  shown  discrepancies  between  city  and  rural  chil¬ 
dren  in  mental  ability,  and  local  research  (Reid  and 
Conquest,  1955)  indicates  that  Alberta  children  are 
typical  in  this  respect.  Nevertheless,  if  the  pupils 
examined  in  the  present  investigation  are  representative 
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of  Edmonton  children  as  a  whole  it  is  reasonable  to 
argue  that  significant  findings  may  well  be  genera¬ 
lized  to  other  Canadian  cities  of  comparable  size  and 
composition.  Some  information  on  the  locale  of  the 
study  should  therefore  be  outlined. 

Edmonton  is  a  rapidly  expanding  central  Alberta 
community  of  nearly  300,000  people.  The  surrounding 
region  is  an  important  farming  and  cattle-raising  dis¬ 
trict,  but  its  main  source  of  prosperity  is  oil. 
Incorporated  as  a  town  in  1$92  and  as  a  city  in  1904, 
Edmonton  has  seen  very  rapid  growth  in  recent  years,  a 
growth  which  is  reflected  in  the  unusual  mobility  of 
citizens,  particularly  in  the  outlying  suburbs.  How¬ 
ever,  residential  stability  is  apparent  in  several 
central  and  suburban  areas.  In  one  school  on  the  near 
south  side  of  the  North  Saskatchewan  River,  for  example, 
over  eight}'  per  cent  of  the  children  tested  in  the 
Grade  VII  study  were  present  for  the  Grade  III  study, 
four  years  earlier. 

Occupationally,  all  levels  of  the  Blishen  Scale 
are  well  represented  in  Edmonton  with  the  exception  of 
farm  and  rural  workers.  The  mean  occupational  T  score 
of  a  random  sample  of  over  four  hundred  Grade  VI  pupils 
was  only  1.63  higher  than  Blishen’ s  mean  of  30,  while 
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the  sample  standard  deviation  was  9-35,  merely  0.65 
points  less  than  the  expected  population  value.  ihere- 
fore,  Edmonton  citizens  appear  to  be  relatively  typical 
Canadians  in  respect  to  occupation. 

Socially,  there  is  little  sign  of  Warner’s 
Upper-Upper  class  in  Edmonton,  since  the  comparative 
youth  and  rapidly  acquired  prosperity  of  the  city  makes 
the  possibility  of  a  long-established  wealthy  set  un¬ 
likely.  Expensive  homes  and  exclusive  residential  dis¬ 
tricts  do  exist,  but  their  occupants  are  more  often 
members  of  the  ’’nouveau  riche”  and  hence,  of  Warner’s 
Lower-Upper  Class.  Consequently,  comparisons  with 
American  studies  using  Warner’s  system  of  social  strat¬ 
ification  must  be  made  with  this  difference  in  mind. 
Other  social  levels  on  Warner’s  Index  are  well  repre¬ 
sented,  but  may  not  conform  precisely  to  the  scales 
used  in  the  present  investigation. 

The  Sample  Tested.  After  discussion  with  school 
board  authorities,  three  large  schools  were  selected 
for  the  Grade  VII  testing  program.  Since  it  was  desired 
to  choose  a  sample  typical  of  the  Edmonton  Grade  VII 
population,  careful  consideration  was  given  to  such 
factors  as  academic  averages,  socio-economic  represen¬ 
tation,  residential  mobility,  geographic  location,  and 
school  type. 
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Of  the  three  schools  finally  selected,  one  was  an 
average-sized  south  side  Elementary  Junior  High  School, 
which  provided  sixty-six  Grade  711  pupils  mostly  from 
a  stable  residential  area  of  high  socio-economic  status. 
The  second,  also  an  Elementary  Junior  High  School, 
added  ninety-nine  more  pupils  who  represented  a  wide 
range  of  cultural  levels.  Population  figures  (Census, 
1956)  in  the  census  tract  surrounding  this  school 
indicate  a  greater  than  one  hundred  per  cent  increase 
in  the  five  years  between  1951  and  1956,  a  character¬ 
istic  which  is  typical  of  many  outlying  Edmonton  sub¬ 
urbs.  The  third  school,  a  Junior  High  School  in  the 
northern  outskirts ,  provided  one  hundred  six  Grade  VII 
pupils,  who  clustered  mainly  around  the  lower  socio¬ 
economic  strata. 

Each  school  differed  considerably  in  location, 
size,  administration,  average  ability  level,  and  socio¬ 
economic  characteristics  of  their  populations.  The 
extent  to  which  they  were  representative  of  the  Edmonton 
Grade  VII  public  school  population  was  examined  by  com¬ 
paring  the  selected  sample  of  two  hundred  seventy-one 
pupils  with  a  randomly  chosen  group  of  four  hundred 
thirty-two  pupils,  drawn  from  the  Edmonton-wide  Grade 
VI  survey  administered  in  the  previous  year0  The  two 
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groups  were  compared  on  the  basis  of  age,  intelligence 
quotient,  occupational  level  and  sex,  and  the  results 
reported  in  Chapter  VII.  No  significant  differences 
were  observed. 

Other  relevant  data  showed  that  two  hundred  forty- 
four  of  the  two  hundred  seventy-one  pupils  in  the  sample 
were  Canadian  born.  Of  the  remainder,  nine  were  English, 
eight  German,  seven  Dutch,  and  three  miscellaneous. 

Forty  pupils  reported  a  language  other  than  English 
spoken  in  the  home,  but  only  nineteen  of  these  used  it 
more  than  occasionally. 

Test  Instruments  Used .  Previous  research  on 
socio-economic  differences  in  intelligence  suggested 
seven  criteria  (Postulate  10)  which  should  be  approached 
by  tests  which  are  to  show  the  greatest  promise  as  cul¬ 
ture-reduced  instruments.  Consequently,  the  matter  of 
test  selection  resolved  itself  into  the  problem  of 
finding  reliable  group  tests  which  are  measures  of 
general  intellectual  ability,  which  are  largely  non¬ 
verbal  and  non-numerical ,  which  show  some  indication 
of  being  relatively  free  from  cultural  bias,  and  which 
appear  to  have  practical  utility  in  terms  of  economy, 
ease  of  administration,  and  group  use.  Several  more 
tests  were  selected  to  compare  with  the  culture-reduced 
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tests,  and  to  ensure  a  wide  sampling  of  abilities  in 
the  main  factor  analysis.  After  a  review  of  test  man¬ 
uals  and  relevant  literature,  the  following  instruments 
were  chosen: 

1.  Raven1  s  Standard  Progressive  Matrices  (195.6) 
was  selected  as  a  most  promising  culture-reduced  intel¬ 
ligence  test  for  several  reasons.  It  is  non-verbal, 
unspeeded,  intrinsically  interesting,  easily  adminis¬ 
tered,  and  in  one  reviewer’s  opinion,  as  good  a  measure 
of  pure  g  through  one  non-educational  modality  as  has 
appeared  on  the  market  (Burke,  195$).  Raven,  Vernon, 
Emmett,  Nisbet,  and  Banks  have  all  reported  g  loadings 
over  .79,  while  American  investigators  who  did  not 
allow  for  a  general  factor  gave  the  test  high  loadings 
on  an  Inductive  or  Reasoning  factor  (for  example, 
Rimoldi,  1946). 

For  one  hundred  twenty  thirteen-year-old  children 
Raven  reported  a  re-test  reliability  figure  of  .66. 

Burks  and  Sinha,  as  reported  in  Burke  (1956),  found  an 
average  reliability  figure  of  .66  when  they  used  split 
half,  Kuder-Richardson  and  another  analysis  of  variance 
technique . 

The  test  is  composed  of  sixty  items  classified 
into  five  sets  of  twelve,  each  set  being  relatively 
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homogeneous  in  the  nature  of  the  principles  required 
in  solution.  The  first  problem  of  each  set  is  easy 
enough  to  be  self-explanatory,  and  the  progressive 
increase  in  difficulty  within  each  set  makes  the  in¬ 
strument  a  tester  of  what  it  teaches. 

Each  item  consists  of  a  matrix  or  pattern  of 
nine  figures,  one  of  which  is  omitted.  The  task  of 
the  examinee  is  to  choose  from  the  six  or  eight  fig¬ 
ures  set  out  below,  the  one  which  completes  a  pattern, 
which  must  be  educed  from  the  remaining  eight.  The 
principles  differ  slightly  from  set  to  set,  but  most 
factor  analyses  of  the  test  have  shown  little  besides 
g,  with  occasional  small  loadings  on  a  spatial  factor 
(Vernon,  1950). 

Further  justification  for  the  selection  of  the 
Raven  test  can  be  seen  in  cross-cultural  research  by 
Rimoldi  (194&)  and  Tuddenham  (195$),  who  showed  that 
Argentine  and  United  States  children  yielded  similar 
norms  to  RavenTs  English  standardization  samples. 
Moreover,  the  coloured  version  of  the  Matrices  has 
already  shown  promise  in  the  Grade  III  Edmonton  Survey 
(MacArthur,  I960),  in  which  it  showed  a  high  loading 
on  the  general  factor  only.  The  opportunity  for  longi' 
tudinal  comparison  made  Raven’s  test  a  logical  choice 
for  this  study. 
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2,  I . P . A . T .  Cattell  Test  of  g,  "Culture-Free , M 
Scale  two  was  chosen  because  it  was  originally  designed 
as  a  culture-free  test  of  the  g  factor.  The  evidence 
offered  in  Chapter  II  suggests  that  it  may  be  a  good 
non-verbal  measure  of  g  well  worthy  of  further  study. 
The  scale  used  in  this  investigation  consists  of  four 
sub-tests,  containing  a  total  of  forty-six  items  of  a 
perceptual  nature.  It  is  economical,  easy  to  admin¬ 
ister  and  can  be  given  in  less  than  twenty  minutes. 
Cattell  (1958)  reported  immediate  re-test  reliability 
figures  of  .85  on  four  hundred  fifty  British  secondary 
school  entrants,  and  split  half  coefficients  of  .70, 
.86,  .87,  and  .92  on  four  different  samples. 

The  first  test,  called  Series,  contains  twelve 
items.  Three  figures,  changing  progressively  according 
to  one  or  more  rules,  are  followed  by  a  blank  fourth. 
The  examinee  is  required  to  select  the  fourth  in  the 
series  from  a  group  of  five  figures.  The  classifica¬ 
tion  test  consists  of  fourteen  items,  each  containing 
five  different  geometrical  patterns.  The  subject  is 
required  to  generalize  in  order  to  find  a  common  prin¬ 
ciple,  and  then  select  the  odd  figure  on  the  basis  of 
the  generalization  derived.  The  third  test  is  a  matrix 
type  similar  to  Raven’s,  and  the  fourth  consists  of 
eight  items  where  the  examinee  is  called  to  select  one 


-  ' 

-  .  .  ■  ■ 

■ 


. 

:  .  ;•  '  J 

. 

. 

. 

. 

• 

■  •  : 

c  •  • 

. .  ' 

'  •  >  JT'  .!  ' 


-■  •  .  ’  i 

. 


figure  from  five  in  which  certain  conditions  pertain 
similar  to  that  of  the  model. 

Compared  with  Raven’s  Matrices,  the  Cattell  has 
the  merit  of  examining  through  four  rather  than  one 
problem  type,  but  has  the  compensating  disadvantage  of 
requiring  more  instructions  to  solve  the  first  items. 

3.  The  Lorge- Thorndike  Non-Verbal  Intelligence 
Test ,  Level  4,  (1957)  is  a  more  recent  American  test, 
similar  to  the  previous  two  in  that  it  is  non-verbal, 
easy  to  administer  within  a  class  period  and  primarily 
a  test  of  power  rather  than  speed.  For  the  standardi¬ 
zation  sample,  the  split-half  reliability  was  .93,  and 
the  alternate  forms  reliability  coefficient vas  „78. 
Correlations  with  other  verbal  intelligence  tests  were 
as  high  as  .7  to  .$,  yet  inspection  of  the  test  items 
suggests  very  little  dependence  on  past  learnings. 

The  three  sub-tests  require  more  verbal  instruc¬ 
tions  than  the  Raven  test,  and  the  first  items,  being 
more  difficult,  are  not  self-explanatory.  Nevertheless, 
the  items  could  be  solved  in  a  variety  of  languages  or 
modes  of  expression,  making  previous  acquisitions  less 
important.  The  Figure  Classification  sub-test  is  a 
series  type,  similar  to  the  first  in  the  Cattell  test 
of  g.  Then  follows  a  Number  Series  sub-test  in  which 
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a  pattern  has  to  be  discerned  in  the  numerals  before 
the  series  can  be  completed.  Obviously,  facility  with 
numbers  would  improve  an  individual’s  score  on  these 
items,  a  fact  which  counts  against  its  possible  use  for 
culture-reduced  intelligence  measures.  The  third  sub¬ 
test,  called  Figure  Analogies,  presents  the  examinee 
with  the  problem  of  observing  a  relationship  between 
two  diagrammatic  figures  and  selecting  another  figure 
from  five  to  complete  a  pair  with  an  analogous  rela¬ 
tionship  to  the  first  two. 

The  test  has  face  validity  in  that  it  appears 
to  be  sampling  the  ability  to  see  relationships  and 
manipulate  abstract  concepts,  characteristics  which 
the  authors  deem  important  in  their  description  of 
intelligent  behaviour.  As  Freeman  points  out  (1955), 
the  test  constructors  have  drawn  on  the  concepts  of 
Spearman  and  Terman  in  developing  their  construct  of 
intelligence.  Evidence  on  the  g  loadings  and  predic¬ 
tive  validity  of  the  tests  is  so  far  lacking,  but 
several  reviewers  regard  it  as  one  of  the  best  set  of 
intelligence  tests  on  the  market  today  (Buros,  195$). 

4.  Holzinger-Crowder  Uni-Factor  Test  (1952), 
a  recent  test  of  the  multi-aptitude  variety,  was  sel¬ 
ected  for  the  present  study  because  it  could  provide 
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two  promising  non-verbal  reasoning  tests  which  might  be 
useful  culture-reduced  measures  of  g,  and  a  spatial  test 
to  add  to  the  battery  for  factor  analytic  purposes. 
According  to  the  manual  and  several  reviewers  (Buros, 
1959),  the  test  has  been  well  constructed  and  standar¬ 
dized,  reliabilities  are  high  and  intercorrelations 
between  factors  low.  Although  the  verbal  factor  showed 
better  predictive  validity  with  school  marks,  it  was  not 
used  because  several  other  verbal  tests  were  available. 
The  reasoning  tests  showed  split-half  reliability 
coefficients  of  .93  on  Grade  VII  children  and  median 
correlations  about  .5  with  school  achievement.  To  this 
extent  they  appeared  promising,  particularly  since  two 
of  the  three  tests  are  non-verbal.  Of  these  two,  the 
Figure  Changes  test  appears  to  be  less  dependent  on 
past  learning  as  it  uses  only  perceptual  items  requiring 
ability  to  see  and  use  a  relationship.  The  Mixed  Series 
test  which  consists  of  number-letter  series  completions, 
may  favour  children  with  greater  facility  in  arithmetic 
and  knowledge  of  the  alphabet.  The  first  spatial  test 
required  the  subject  to  decide  whether  two  boots  were 
viewed  from  the  same  or  different  sides.  Since  the 
other  sub-tests  were  duplicated  elsewhere  in  the  battery 
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Vernon  (1959)  pointed  out  that  the  verbal  and 
reasoning  sub-tests  are  the  only  two  useful  for 
academic  prediction,  and  inferred  that  they  are 
measuring  the  g  and  v:ed  factors  which  are  stressed 
by  global  intelligence  tests. 

5.  California  Test  of  Mental  Maturity  (CTMM), 
Short-Form,  Elementary  (1957)  was  included  for  two 
reasons.  It  is  a  popular  and  well-reviewed  intelli¬ 
gence  test  of  the  culture-loaded  variety  (Eells,  1951) 
and  it  was  used  in  the  Grade  III  survey  along  with 
the  Raven  Coloured  Progressive  Matrices.  Consequently, 
it  was  useful  for  making  comparisons  with  age  and 
socio-economic  status. 

According  to  Burt  (1959  a),  it  is  one  of  the 
best  sets  of  group  tests  available.  It  furnishes 
separate  factor  scores  for  spatial,  logical,  numerical, 
and  verbal  abilities,  a  language,  a  non-language,  and 
a  total  intelligence  quotient.  Correlations  of  the 
intelligence  quotients  with  other  verbal  intelligence 
tests  are  frequently  in  the  region  of  .8  and  split-half 
reliability  coefficients  for  sub-tests  range  from  .87 
to  .93.  However,  the  factor  scores  have  been  criti¬ 
cized,  since  they  were  developed  from  "logical"  rather 
than  factorial  constructs  (Buros,  1959).  The  correla¬ 
tions  between  factors  range  from  .3  to  .6,  which 
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indicates  the  probability  of  a  large  general  factor; 
moreover,  no  evidence  is  shown  of  the  usefulness  of 
the  sub-tests  in  educational  guidance.  It  was  hoped 
that  further  evidence  of  their  meaning  could  be  ob¬ 
tained  from  this  study. 

The  whole  test  requires  an  hour  of  administra¬ 
tion  time,  but  is  relatively  easy  to  give,  and  the 
"scoreze"  answer  sheets  make  marking  quick  and  accurate. 
The  spatial  factor  is  made  up  of  two  tests,  one  consis¬ 
ting  of  pictures  of  separate  hands  or  feet  in  various 
positions,  which  are  to  be  labelled  as  left  or  right. 

The  second  is  similar  to  the  Thurstone  space  tests, 
in  that  a  figure  has  to  be  recognized  in  rotated  form 
amongst  a  group  of  five.  The  first  logical  sub-test 
is  a  classification  test  requiring  the  pupil  to  select 
a  picture  which  conforms  to  the  principle  common  to  a 
group  of  familiar  objects.  The  next  test  consists  of 
a  list  of  logical  syllogisms  in  verbal  form.  A  number 
series  and  an  arithmetic  reasoning  test  make  up  the 
numerical  factor,  and  a  vocabulary  multiple-choice 
test  yields  a  verbal  concepts  factor. 

6.  Laycock  Mental  Ability  Test  (1933)  is  an 
older  test,  used  extensively  in  Western  Canada  for  the 
routine  testing  of  school  pupils.  It  was  administered 
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to  the  children  in  the  present  investigation  in  Grade  V, 
and  was  used  in  the  correlation  and  factor  analytic 
studies  for  comparison  with  the  culture-reduced  tests. 

The  Laycock  test  was  ’’constructed  on  the  basis 
of  Spearman’s  theory  of  the  nature  of  intelligence” 
and  ’’aims  to  measure  Spearman’s  general  factor  g.” 
Consequently,  most  of  the  tests  are  those  involving 
the  eduction  of  relations  and  correlates  sampled 
through  verbal,  numerical,  and  abstract  symbols. 

Since  the  vocabulary  level  is  high  and  over  seventy- 
six  per  cent  of  the  one  hundred  sixty-eight  items  are 
verbal,  it  was  predicted  that  the  test  would  sample 
more  than  g,  and  that  it  would  show  some  degree  of 
cultural  loading. 

Concurrent  validity  is  shown  by  correlations 
about  .  S  with  other  popular  intelligence  tests  of  a 
global  kind,  and  median  coefficients  of  .6  for  average 
school  achievement.  Reliability  figures  quoted  in  the 
manual  are  high,  varying  from  .SO  (re-test)  to  .95  (split 
halves).  The  sub-tests  are  named  Classification,  Anal¬ 
ogies,  drawing  Analogies,  Opposites,  Inferences,  Number 
Series,  and  Completion.  Results  are  expressed  in  terms  of 
Total  Intelligence  Quotient  only. 
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7.  California  Achievement  Battery  Junior  High 
(1957)  was  not  administered  in  the  I960  program,  since 
it  was  given  in  the  Grade  III  and  VI  surveys  and  was 
intended  only  to  help  identify  factors  in  the  main  fac¬ 
tor  analytic  study.  The  test  provides  nine  scores: 
Namely:  heading:  Comprehension,  Vocabulary,  and  Total. 

Arithmetic :  Reasoning,  Fundamentals,  and  Total. 

Language :  Mechanics,  Spelling,  and  Total. 

The  battery  is  well  standardized  across  forty-eight  of 
the  United  States,  Kuder-Ri chard son  reliability  coef¬ 
ficients  for  the  sub-tests  range  from  .83  to  .95,  and 
correlations  with  similar  batteries  are  encouraging 
enough  to  indicate  good  construct  validity  ( Tiegs  and 
Clark,  1957). 

8.  Coloured  Progressive  Matrices  (1947)  was 
used  in  the  Grade  III  survey,  and  by  virtue  of  its 
similarity  to  the  1956  edition  used  in  Grade  VII,  pro¬ 
vides  an  excellent  means  for  comparing  correlations 
with  socio-economic  status  at  different  age  levels. 

The  coloured  version  consists  of  three  sets;  A  and  B 
are  identical  to  those  in  the  main  test,  except  for 
the  addition  of  bright  colours  in  the  Matrices;  the 
third  set  has  been  added  to  make  for  greater  dis¬ 
crimination  in  young  children.  Re-test  reliability 
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coefficients  of  ,8  are  reported  with  nine-year-olds 
(Burke,  1958),  while  Martin  and  Wiechers  showed  a  corre 
lation  of  .9-1  with  WISC  intelligence  quotient  (1954)  in 
a  group  of  one  hundred  Grade  III  students. 

9.  California  Test  of  Mental  Maturity  Primary 
(1953)  was  another  major  instrument  used  in  the  Grade 
III  survey.  It  is  very  similar  to  the  Elementary 
edition  in  that  it  presents  the  pupil  with  the  same 
problems  under  the  same  sub-test  labels  but  at  a  dif¬ 
ferent  level  of  difficulty,.  More  importance  seems  to 
be  attached  to  listening  for  instructions  in  the  Pri¬ 
mary  edition  than  in  the  Elementary,  which  depends  more 
upon  reading.  At  both  levels  there  are  a  large  number 
of  items  which  sample  abilities  through  a  medium  of 
past  learnings;  for  example,  vocabulary,  knowledge  of 
animals,  occupations,  and  so  on. 

10.  Occupational  Scale  is  an  objective  scale 
compiled  from  Canadian  census  data  on  the  basis  of 
average  income  and  years  of  training.  (See  Appendix  B) 
As  reported  in  Chapter  V,  it  showed  a  correlation  of 
•94  with  an  American  scale  constructed  from  represen¬ 
tative  public  poll  on  the  basis  of  fTsocial  standing.” 
Information  on  parental  occupation  was  collected  from 
the  Grade  VII  students  by  means  of  a  questionnaire 
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(See  Appendix  A)  and  checked  against  Henderson’s 
Directory  (I960).  This  directory  comprises  a  complete 
list  of  all  Edmonton  citizens,  with  information  on 
name,  address,  occupation,  and  occupation  of  wife,  col¬ 
lected  from  a  house-to-house  survey  in  March  of  each 
year.  Classification  of  occupations  was  facilitated 
by  a  complete  index  of  Canadian  occupations  published 
by  the  Bureau  of  Statistics  (1951). 

11 o  Home  Index  Questionnaire  was  adapted  from 
a  questionnaire  devised  by  Gough  (1949).  The  original 
scale  showed  re-test  reliability  of  .99  and  a  Kuder- 
Richardson  coefficient  of  .74.  Correlations  with  other 
socio-economic  scales  varied  from  .65  to  .66,  and  pre¬ 
vious  research  with  the  scale  showed  that  it  was  sensi¬ 
tive  to  differences  in  verbal  intelligence  and  achieve¬ 
ment 

A  pilot  study  was  necessary  in  order  to  select 
the  most  suitable  items  for  inclusion  in  the  question¬ 
naire  for  the  present  study.  From  the  original  scale, 
several  items  were  excluded  as  inappropriate  for  Edmon¬ 
ton  conditions  (for  example,  possession  of  a  heating 
system).  More  likely  items  were  included,  bringing  the 
trial  scale  to  twenty-five.  This  was  administered  to  a 
Grade  VI  class,  an  item  analysis  performed  on  the  re¬ 
sults  and  the  five  poorest  discriminators  discarded 0 

Personal  communication  from  Dr. C .M. Christensen. 
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The  final  scale  (See  Appendix  A)  contained  fourteen 
items  from  the  original  scale  and  six  new  ones. 

Further  questions  on  home  language  were  asked  in 
the  questionnaire  for  the  purposes  of  a  parallel  study 
not  reported  here. 

Summary.  A  list  of  variables  used  in  the  main 
factor  analysis,  with  hypothesized  factors,  possible 
score  and  estimated  reliability  coefficients  is  set 
out  in  Table  I,  page  S6. 

The  Testing  Programs .  To  put  the  study  into  per¬ 
spective,  the  chronological  order  and  details  of  test 
administration  will  now  be  described. 

In  May,  1956,  more  than  3,500  Grade  III  children, 
all  those  attending  public  schools  in  Edmonton  at  the 
time,  were  given  the  following  tests: 

1.  California  Short  Form  Test  of  Mental  Maturity 
(Primary,  1953). 

2.  Raven’s  Coloured  Progressive  Matrices  (1947). 

3o  California  Achievement  Battery  (Primary, 

195D. 

4.  Gates  Reading  Tests. 

Tests  were  administered  by  classroom  teachers 
specially  prepared  for  the  task,  and  answer  sheets  were 
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.TABLE  I 


VARIABLES  USED  IN  THE  MAIN  FACTOR  ANALYSIS,  WITH 
HYPOTHESIZED  FACTORS,  POSSIBLE  SCORES, 

AND  RELIABILITY  COEFFICIENTS 


Test 

Hypoth. 

Factors 

Poss. 

Score 

Reliability 

Coefficient- 

1. 

Socio-Economic  Status.  Occupa- 

tion 

g>y>w 

90 

2. 

Home  Index 

g,v,w 

20 

3. 

Raven  Progressive  Matrices 

g 

60 

.88* 

4. 

Cattell  Test  of  g 

g 

46 

.85* 

5. 

Lorge-Thorndike  Fig.  Class 

g 

25 

.71** 

6. 

No.  Series 

gjn 

28 

.84" 

7. 

Fig.  Anal. 

P* 

o 

30 

.87** 

8. 

California  T.M.M. Numerical 

g,n, ed. 

25 

.90“ 

9. 

Verbal 

g,v 

50 

.93“ 

10. 

Spatial 

g»k 

35 

.87** 

11. 

Logical 

g,v 

30 

.87“ 

12. 

Laycock  Mental  Ability 

g,v,n 

168 

C)^AA” 

13. 

Holzinger-Crowder  Spatial 

g,k 

70 

- 

14. 

Mixed  Series 

g)n 

40 

- 

15. 

Fig.  Changes 

g.k 

40 

- 

16. 

California  Achiev.  Vocab. 

g, v , ed. 

90 

^ ^qAAA 

17. 

Comprehension 

g, v, ed. 

55 

^o2AAA 

18. 

Arith.  Reas. 

g, v,n,ed. 

55 

AAA 

19. 

Arith.  Fund. 

g,n, ed. 

80 

# ^^AAA 

20. 

Lang.  Mechanics 

g,v,ed. 

65 

. 92aaa 

21. 

Spelling 

g, v, ed. 

30 

#  $3AAA 

A  Re-test  reliability  coefficient . 

Ah  Split-half  reliability  coefficient. 

AAA  Kuder-Richardson  reliability  coefficient. 
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scored  by  university  research  students. 

Three  years  later,  in  May  1959,  the  same  children, 
then  in  Grade  VI  were  re-tested  with  a  similar  battery 
appropriate  for  their  grade  level.  The  following  were 
administered  at  that  time. 

1.  California  Short  Form  Test  of  Mental  Maturity. 
(Elementary,  1957). 

2.  California  Achievement  Battery,  (Junior  High, 

1957 ) o 

Information  was  also  obtained  on  Laycock  intel¬ 
ligence  quotients,  parental  occupation,  school  atten¬ 
dance,  family  size,  and  birth  order.  Again,  tests  were 
administered  by  teachers,  although  university  personnel 
advised  the  examiners  and  supervised  the  operation. 

In  May,  I960,  a  sample  of  four  hundred  thirty 
children,  then  in  Grade  VII,  were  given  the  following 
tests : 

1.  California  Short  Form  Test  of  Mental  Maturity 
(Elementary,  1957). 

2.  Raven’s  Progressive  Matrices  (1956). 

3.  I.P.A.T.  Cattell  Test  of  g  -  Culture-Free, 
Scale  2. 

4.  Lorge-Thorndike  Non-Verbal  Intelligence  Test, 


Level  4. 
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5.  Holzinger-Crowder  Uni-Factor  Tests  -  Spatial 
(Number  3),  Reasoning  (Numbers  7  and  $). 

6.  Home  Index  and  Occupational  Questionnaire. 

The  whole  testing  program  was  accomplished  in 

five  weeks,  one  school  being  completed  before  the  next 
was  begun.  Tests  were  administered  in  regular  class 
periods,  one  test  per  day,  with  approximately  the  same 
schedule  for  each  of  the  fourteen  classes. 

Since  the  tests  used  in  the  Grade  VII  program 
constituted  the  chief  reason  and  justification  for  the 
study,  it  was  essential  to  ensure  that  test  administra¬ 
tion,  scoring,  and  tabulation  be  carried  out  with  as 
much  uniformity  as  possible.  ‘The  writer  visited  each 
school  prior  to  the  beginning  of  the  program,  described 
the  purposes  and  details  of  the  project  collectively 
and  individually  to  staff  members,  and  discussed  the 
test  materials  with  each  teacher  concerned.  Fortunately, 
the  majority  of  tests  were  untimed,  and  administrative 
problems  were  reduced  to  a  minimum.  Nevertheless,  when 
difficulties  were  anticipated,  as  in  the  completion  of 
the  socio-economic  questionnaire,  or  in  the  case  of 
teacher  incompetence  or  lack  of  confidence  with  timed 
sub-tests,  the  present  investigator  took  over  the  admin¬ 
istration. 
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Every  class,  visited  by  the  writer  before  test 
administration  began,  was  given  a  description  of  the 
project,  which  was  justified  in  terms  calculated  to 
establish  rapport  and  to  motivate  achievement.  Emphasis 
was  laid  on  the  novelty  and  puzzle-like  nature  of  the 
tests,  the  lack  of  school  problems,  the  opportunity  for 
all  to  show  their  talents,  and  the  various  research  and 
guidance  purposes  to  which  results  could  be  put.  No 
difficulties  were  experienced  with  rapport,  and  although 
some  of  the  questions  to  be  answered  in  the  socio¬ 
economic  questionnaire  could  be  regarded  as  confidential 
material,  no  one  complained  or  refused  to  answer. 

Special  care  was  taken  with  the  completion  of 
occupational  information,  since  previous  experience  had 
shown  that  children  frequently  do  not  describe  the 
nature  of  their  parent’s  occupation  specifically  enough 
to  make  classification  possible.  In  the  few  cases  still 
too  vague  to  be  reliable,  a  follow-up  interview  was 
necessary.  All  occupations  described  were  compared  with 
cumulative  record  data  and  the  local  city  directory 
(Henderson,  I960),  Only  one  pupil  appeared  to  have 
raised  his  father’s  status  (from  bookkeeper  to  accoun¬ 
tant),  while  eight  parents  could  not  be  traced  in  the 
directory. 
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In  the  five  weeks  of  testing,  each  of  the  six¬ 
teen  tests  and  sub-tests  was  administered  fourteen  times. 
Despite  the  precautions  taken,  eight  instances  of  faulty 
test  administration  were  reported.  Since  six  of  these 
were  attributable  to  two  testers,  their  forty-six  pupils 
were  omitted  from  the  study.  The  other  two  were  com¬ 
mitted  with  one  sub-test  only,  the  Holzinger-Crowder 
Spatial;  therefore,  the  classes  were  retained  and  corre¬ 
lations  computed  on  a  restricted  sample  of  two  hundred 
twenty-four  for  that  test.  Maladministration  was  re¬ 
ported  by  the  teachers  concerned,  and  was  reflected  in 
each  instance  by  abnormal  class  score  distribution. 

Analysis  of  Results.  All  Grade  VII  children  in 
the  three  schools  were  tested,  but  only  the  two  hundred 
seventy-one  pupils  for  whom  the  required  data  were 
available  were  included  in  the  main  sample.  This  group 
was  tested  for  representativeness  of  the  Edmonton 
Grade  VII  population  by  comparison  with  a  group  of 
four  hundred  thirty-two  pupils  chosen  randomly  from 
the  Grade  VI  city-wide  survey  data. 

Frequency  distributions  were  made  of  raw  scores 
on  all  variables  in  order  to  test  for  normality.  It  was 
decided  to  use  raw  scores  since  several  tests  and  sub¬ 
tests  did  not  yield  intelligence  quotients,  and  the 
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sample  was  quite  homogeneous  with  respect  to  age.  The 
three  variables  which  did  not  conform  to  the  require¬ 
ments  of  normality  were  converted  to  T  scores,  and  all 
variables  were  then  recorded  on  tape  in  preparation  for 
use  with  the  L.  G.  P.  30  computer. 

Three  socio-economic  variables  were  included  in 
these  data,  occupational  T  score,  home  index  raw  score, 
and  a  combination  of  these  two  measures  with  equal 
weights,  yielding  what  was  termed  a  socio-economic 
index.  The  first  two  were  included  in  order  to  enable 
a  socio-economic  factor  (w)  to  appear  in  the  factor  anal¬ 
ysis;  the  third  was  used  to  test  Hypothesis  II,  which  was 
concerned  with  the  relationship  between  socio-economic 
status  and  various  test  scores.  The  occupational  score 
was  obtained  by  converting  each  parental  occupation  to 
a  scaled  score  on  Blishen’s  index.  This  process  was 
made  considerably  easier  and  more  objective  by  the  use 
of  the  Census  Classification  Index  (Bureau  of  Statistics, 
1951),  which  lists  all  occupations  under  a  variety  of  dif¬ 
ferent  labels  and  classifies  them  according  to  the  census 
data  from  which  Blishen  derived  his  scale. 

An  initial  two-way  analysis  of  variance  was  per¬ 
formed  to  test  for  the  effects  of  sex  against  socio-economic 
status.  The  various  hypotheses  were  subsequently  tested 
by  a  variety  of  methods. 
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Hypothesis  I,  was  examined  by  performing  a  Prin¬ 
cipal  Components  factor  analysis  on  the  inter-correlation 
matrix  of  the  twenty-one  variables  listed  on  page  86. 

This  was  designed  to  show  which  tests  could  be  inter¬ 
preted  as  measures  of  the  general  factor  and  which  had 
loadings  on  group  factors. 

Hypothesis  II,  which  was  concerned  with  the 
relationship  between  socio-economic  status  and  test  scores, 
was  examined  by  comparing  correlations  of  verbal  and 
non-verbal  intelligence  tests  with  the  combined  socio¬ 
economic  status  index,  and  by  testing  for  significance 
of  differences  in  correlation  coefficients.  Each  score 
on  these  tests  was  then  normalized  and  converted  to  a 
T  score,  the  sample  was  divided  into  five  socio-economic 
classes  of  approximately  equal  size,  and  the  mean  and 
standard  deviation  calculated  for  each  class  on  each 
test.  The  obtained  means  were  then  plotted,  and  tested 
for  linearity  of  regression. 

In  order  to  test  Hypothesis  III,  the  Raven  and 
California  test  scores  for  Grades  III  and  VII  were  con¬ 
verted  to  T  scores,  and  correlations  with  socio-economic 
status  were  computed  for  each  variable.  The  hypothesis 
of  greater  increase  with  the  culture-loaded  test  was 
examined  by  testing  for  the  difference  between  correlation 
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coefficients  at  Grade  III  and  Grade  VII.  Changes  in 
T  score  were  also  calculated  for  each  variable  and 
correlated  with  socio-economic  status  in  another  ap¬ 
proach  to  the  same  problem. 

Hypothesis  IV  required  an  item  analysis  of  a 
culture-loaded  and  a  culture-reduced  test  in  order  to 
investigate  the  respective  patterns  of  item  discrimin¬ 
ation  between  socio-economic  levels.  Raven’s  Progres¬ 
sive  Matrices  and  the  California  verbal  sub-test  were 
selected  as  typical  representatives  of  each  type  of 
measure,  and  the  high  and  low  socio-economic  thirds  of 
the  sample  of  two  hundred  seventy-one  pupils  were  com¬ 
pared  for  each  item.  On  each  item  with  an  average 
difficulty  level  between  ten  per  cent  and  ninety  per 
cent,  the  percentage  of  higher  and  lower  pupils  passing 
was  converted  to  Davis’  discrimination  index,  a  z  trans¬ 
formation  of  Flanagan’s  biserial  correlation  coefficient. 
The  mean  and  variance  of  the  discrimination  indices  of 
each  item  on  both  tests  were  calculated,  and  t  and  F 
tests  used  to  examine  the  significance  of  the  differences 
between  the  values  obtained.  A  higher  mean  on  the  verbal 
test  would  indicate  greater  discrimination  between  socio¬ 
economic  levels,  while  a  large  variance  would  suggest 
that  some  items  are  far  more  difficult  for  lower  class 
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children  than  others „  A  test  which  would  be  equally 
suitable  for  all  socio-economic  levels  should  reveal 
a  low  item  discrimination  variance  between  upper  and 
lower  classes.  In  connection  with  this  hypothesis, 
items  with  very  large  discrimination  indices  were 
examined  in  order  to  throw  light  on  the  identifica¬ 
tion  of  items  unsuitable  for  lower  class  children. 

When  each  hypothesis  had  been  investigated, 
the  most  promising  tests  and  sub-tests  which  met  the 
criteria  set  up  for  good  culture-reduced  intelligence 
tests  were  examined  more  closely,  and  implications  drawn 
for  future  test  selection  and  construction  when  the 
influences  of  cultural  acquisitions  are  to  be  minimized. 
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CHAPTER  VII 


PRELIMINARY  RESULTS 

Test  for  Sampling  Bias .  In  order  to  test  the 
experimental  group  for  representativeness  of  the 
Edmonton  Public  School  population,  a  random  sample  of 
four  hundred  thirty-two  pupils  was  selected  from  the 
city-wide  Grade  VI  Survey  and  compared  with  the  two  hun¬ 
dred  seventy-one  Grade  VII  pupils  finally  chosen  for 
the  testing  of  the  major  hypotheses.  Comparisons  were 
made  on  the  basis  of  age,  sex,  Laycock  Intelligence 
Quotient,  and  occupational  status  of  father.  The  results 
of  these  calculations  are  set  out  in  Table  II  with  the 
Grade  VII  age  mean  reduced  by  twelve  months. 

Since  none  of  the  observed  differences  between 
sample  and  estimated  population  means  and  standard  devia¬ 
tions  is  significant,  the  hypothesis  of  no  difference  in 
age,  intelligence  quotient,  occupation,  and  sex  is  accep¬ 
ted.  Although  this  does  not  prove  that  no  sampling  bias 
exists,  it  enables  the  reader  to  generalize  the  results 
of  this  study  to  the  Edmonton  Grade  VII  population 
with  a  greater  degree  of  confidence. 
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MEANS  STANDARD  DEVIATIONS,  PERCENTAGES,  AND  SIGNIFICANCE  LEVELS 
OF  A  GRADE  VI  RANDOM  SAMPLE ,  COMPARED  WITH  A  SELECTED 
GRADE  VII  SAMPLE,  ON  THE  BASIS  OF  AGE,  INTELLIGENCE 

QUOTIENT,  OCCUPATION,  AND  SEX  _ 
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A  Reduced  by  twelve  months. 
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Test  for  Sex  Differences .  ihe  effects  of  sex 
on  test  scores  was  contrasted  with  the  influence  of 
socio-economic  status  in  an  attempt  to  assess  the  re¬ 
lative  importance  of  each  variable.  An  important 
assumption  of  the  study  was  that  socio-economic  factors 
contributed  a  significant  influence  on  test  score  var¬ 
iance.  If  sex  differences  could  be  shown,  however, 
then  separate  analyses  for  boys  and  girls  would  have 
been  considered. 

The  design  used  for  this  comparison  was  a  two- 
way  analysis  of  variance  with  unequal  frequencies  in 
each  cell,  as  described  by  Walker  and  Lev  (1953).  The 
model  is  an  approximate  one,  which  derives  a  mean 
square  for  error  from  all  scores  in  the  sample,  and 
mean  squares  for  main  effects  from  the  mean  value  of 
each  cell. 

Three  tests  deemed  to  be  representative  of  the 
battery  were  selected  for  this  part  of  the  study.  Those 
tested  were  Raven1 s  Standard  Progressive  Matrices,  the 
C.T.M.M.  Numerical  sub-test  and  the  California  Achieve¬ 
ment  Reading  Total.  The  results  for  each  analysis  by 
sex  and  occupation  of  father  (four  levels)  are  listed 


in  Tables  III  to  V. 
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TABLE  III 


TWO-WAY  ANALYSIS  OF  VARIANCE  BY  SEX 
AND  OCCUPATION  OF  RAW  SCORES 
ON  PROGRESSIVE  MATRICES 


(N-271) 

Source 

Sura  Sqs. 

d.f . 

Mean  Sq. 

F 

Signif . 

Total 

14.16 

7 

- 

- 

- 

Sex 

0.05 

1 

0.05 

0.06 

N.S. 

Occupation 

12.41 

3 

4  •  14 

3.34 

.05 

Interaction 

1.70 

3 

0.57 

0.05 

N.S. 

Error 

- 

263 

1.08 

- 

- 

-  • 
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TABLE  IV 

TWO-WAY  ANALYSIS  OF  VARIANCE  BY  SEX  AND 
OCCUPATION  OF  RAW  SCORES  ON  THE 
CALIFORNIA  ACHIEVEMENT 
READING  TOTAL 


(N=271) 

Source 

Sum  Sqs. 

d.f. 

Mean  Sq. 

F 

Signif . 

Total 

277.97 

7 

— 

— 

— 

Sex 

4.33 

1 

4.33 

0.41 

N.S. 

Occupation 

224.62 

3 

74.87 

7.01 

0.01 

Interaction 

49.02 

3 

16.34 

1.53 

N .  3  0 

Error 

- 

263 

10.68 

- 

- 

- 
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TABLE  V 

TWO-WAY  ANALYSIS  OF  VARIANCE  BY  SEX  AND 
OCCUPATION  OF  RAW  SCORES  ON 
THE  CTMM  NUMERICAL  TEST 


(N-271) 

Source 

Sum  Sqs . 

d.f  • 

Mean  Sq. 

F 

Signif 

Total 

8.43 

7 

- 

- 

- 

Sex 

0.02 

1 

0.02 

0o06 

N.S . 

Occupation 

7.66 

3 

2.55 

7.29 

0.01 

Interaction 

0.75 

3 

0.25 

0.71 

N.S. 

Error 

- 

263 

0.35 
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A  perusal  of  Tables  III  to  V  indicates  that  sex 
contributes  little  or  no  variance  to  test  scores,  while 
occupational  status  reveals  a  significant  effect  with 
each  variable.  The  assumption  of  the  importance  of 
socio-economic  status  in  determining  test  scores  appears 
justifiable. 

Another  possible  influence  on  test  scores  is  that 
of  schools.  However,  analysis  of  variance  could  not  be 
used  to  test  this  effect,  since  the  assumption  of  the 
equivalence  of  occupational  status  categories  within  each 
school  could  not  be  held.  For  instance,  the  children 
from  School  A  in  the  top  socio-economic  class  had  a  mean 
score  of  71*02  on  the  occupational  scale,  which  is  sig¬ 
nificantly  higher  than  the  corresponding  mean  of  61.38 
at  School  C.  Moreover,  if  it  were  possible  to  hold 
occupational  status  constant,  it  can  be  argued  that 
any  difference  in  abilities  which  did  appear  between 
schools  could  be  attributed  to  the  influence  of  socio¬ 
economic  variables  on  school  atmosphere  and  attitudes 
towards  learning.  Thus,  eighty  per  cent  of  the  children 
from  School  A  belonged  to  the  highest  occupational 
groups  compared  with  only  thirty  per  cent  of  School  C. 

It  is  reasonable  to  assume  that  the  prevailing  attitudes 
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towards  education  in  School  A  were  more  conducive  to 
classroom  learning  than  those  of  School  C.  Further¬ 
more,  any  differences  in  educational  climate,  whether 
due  to  socio-economic  influences  or  better  teaching, 
could  well  be  regarded  as  part  of  the  cultural  environ¬ 
ment  of  the  pupils.  Since  it  is  a  pre-requisite  of  a 
culture-reduced  test  that  the  influence  of  both  these 
environmental  influences  be  minimized,  their  separation 
seems  unimportant  in  the  identification  of  good  culture- 
reduced  tests.  For  the  interpretation  of  high  correla¬ 
tions  between  test  scores  and  socio-economic  status, 
however,  it  should  be  remembered  that  the  latter  variable 
may  well  include  the  influence  of  schools. 

The  Home  Index  Questionnaire.  Since  this  scale 
contained  six  original  items,  it  was  necessary  to  inves¬ 
tigate  its  effectiveness  for  the  purpose  of  estimating 
socio-economic  status.  Its  reliability  coefficient  was 
calculated  by  the  split-half  method,  and  corrected  by 
the  Spearman-Brown  formula,  yielding  a  coefficient  of 
.77  on  the  sample  of  four  hundred  thirty-two  pupils. 

This  compares  favourably  with  Gough’s  figure  of  .74 
for  the  original  scale. 

The  validity  of  a  measure  of  this  kind  is 


difficult  to  evaluate.  A  perusal  of  the  items  suggests 
that  it  has  good  face  validity  and  many  of  the  items 
have  well  survived  the  test  of  time.  (Gough,  1949). 
Using  the  construct  validity  approach  of  Cronbach  and 
Meehl  (1955),  the  writer'  predicted  a  high  correlation 
with  the  occupational  scale,  medium  correlations  with 
verbal  and  culturally  loaded  tests,  and  low,  but  posi¬ 
tive,  correlations  with  non-verbal  and  spatial  tests. 
That  all  these  predictions  were  confirmed  can  be  seen 
from  the  results  of  Table  VI,  on  page  104. 

Apparently,  the  Home  Index  Questionnaire  is  a 
slightly  more  sensitive  instrument  than  the  occupational 
scale.  The  correlation  between  the  two  socio-economic 
variables  is  similar  to  the  corresponding  value  of  .65 
obtained  by  Gough  (1949). 

To  throw  more  light  on  the  similarity  of  the 
pattern  of  relationships  between  the  two  socio¬ 
economic  scales  and  all  the  tests  used  in  the  Grade  VI 
and  VII  batteries,  a  Spearman  Rank  Order  Correlation 
Coefficient  was  computed  between  the  two  sets  of  corre¬ 
lation  coefficients  of  each  scale  with  the  twenty-nine 
other  variables.  The  value  of  rho  was  0S7.  a  figure 
high  enough  to  suggest  that  the  two  measures  of  socio¬ 
economic  status  are  tapping  highly  similar  correlates  of 
mental  ability. 
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TABLE  VI 

CORRELATIONS  OF  SCORES  ON  THE  HOME  INDEX 
QUESTIONNAIRE  AND  OCCUPATIONAL  SCALE 
WITH  SELECTED  VARIABLES 


(N-271) 

Test 

Home 

Index 

Occupation 

Occupation 

.61 

— 

C.T.M.M.  Language 

.40 

o33 

Laycock  M.A„T. 

.32 

.31 

Progressive  Matrices 

.23 

.16 

Cattell  Test  of  g 

.24 

.19 

Holz-Crowder  Spatial 

.17 

to 

0 

• 

i 

5 
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Summary.  A  preliminary  examination  of  the 
data  obtained  from  the  Grade  VII  testing  program  led 
to  the  following  conclusions: 

1.  The  selected  sample  of  two  hundred  seventy- 
one  children  did  not  differ  significantly  from  the 
estimated  Grade  VII  population  means  and  standard 
deviations  on  the  variables  of  age,  Laycock  Intelli¬ 
gence  Quotient,  occupation,  and  sex.  Therefore, 
generalization  of  the  results  of  this  study  to  the 
Edmonton  Gade  VII  population  can  be  made  with  confi¬ 
dence. 

2.  No  significant  sex  differences  were  re¬ 
vealed  in  a  two-way  analysis  of  variance  on  three  tests, 
thought  to  be  representative  of  the  battery:  namely; 
Progressive  Matrices,  CTMM  Number  Test,  and  California 
Achievement  Test  Reading  Total.  Occupational  differ¬ 
ences  were  significant  in  each  instance. 

3.  The  Home  Index  Questionnaire  showed  a  split- 
half  reliability  coefficient  of  .77  and  a  correlation 
of  .61  with  the  occupational  scale.  It  conformed  to 
expectation  in  correlations  with  other  variables  used 
in  this  study. 
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CHAPTER  VIII 


THE  FACTOR  ANALYSES 

The  Model.  Two  separate  analyses  were  performed 
in  order  to  test  the  first  hypothesis  concerning  the 
abilities  sampled  by  each  test.  At  first,  the  inter¬ 
correlations  among  twenty-one  tests  and  sub-tests  were 
analyzed  in  order  to  investigate  dimensions  sampled 
by  each  variable.  The  two  socio-economic  variables 
were  also  included  in  the  first  analysis.  Once  the 
principal  factors  of  each  test  and  sub-test  had  been 
identified  another  analysis  was  performed  on  total 
scores  only,  thus  permitting  the  evaluation  of  each 
test  as  a  whole.  Eleven  variables  were  included  in 
the  second  battery. 

The  model  used  for  factoring  was  Hotelling’s 
Principal  Components,  which  has  become  the  preferred 
method  since  the  advent  of  computers.  The  factors 
produced  by  such  an  analysis  correspond  geometrically  to 
the  principal  axes  of  an  ellipsoid  determined  by  the 
position  of  each  variable  in  relation  to  every  other. 
Each  successive  factor  removes  a  maximum  but  decreasing 
amount  of  the  communality  of  the  original  variables, 
and  each  axis  is  regarded  as  being  orthogonal  to  the 
others , 
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The  determination  of  these  axes  is  best  per¬ 
formed  by  the  methods  of  matrix  algebra.  The  first 
factor  coefficients  which  are  represented  by  the  co¬ 
ordinates  of  each  test  on  the  first  principal  axis 
are  selected  so  that  the  variance,  V]_,  accounted  for 
by  the  first  factor  is  at  a  maximum,  where 


V1  =  all2  *  a212  * 
2  ? 

The  symbols  a-Q  ,  &21  '  ‘ 


lnl 


.  .  .  ani  represent 
the  squared  loadings  of  the  ratios  of  each  test  on  the 
first  factore  By  a  process  of  estimating  the  ratios 
of  each  test  loading  on  the  first  factor,  iterating  to 
improve  the  estimates,  and  reducing  the  first  estimates 
to  a  point  where  the  sum  of  their  squares  equals  the 
largest  final  estimate  or  first  eigenvalue  of  the  orig¬ 
inal  correlation  matrix  R^,  the  loadings  a-Q,  &i2>  and 
so  on,  are  calculated.  A  residual  matrix  is  then  set 
up  and  the  process  repeated  until  the  total  communal ity 
has  been  analyzed. 

Since  the  computation  involved  in  the  Principal 
Components  analysis  is  prohibitive  when  the  original 
matrix  consists  of  more  than  nine  or  ten  variables,  a 
program  was  written  by  the  investigator  for  the  L.G.P. 


30  computer  in  order  to  cope  with  the  problem.  This 
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program  increased  not  only  the  speed  but  also  the 
accuracy  of  the  analysis,  by  permitting  re-iteration 
to  improve  conununality  estimates,  and  by  allowing  for 
a  rigorous  test  of  the  rank  of  the  matrix  in  each 
iteration. 

The  Sub-Test  Analysis 

(a)  Extraction  of  Factors.  In  the  testing 
of  Hypothesis  I,  the  twenty-one  variables  listed  in 
Table  VII  were  intercorrelated  on  the  L.0oP.  30  com¬ 
puter  by  the  Pearson  product-moment  method  and  an 
estimate  of  the  communalities  of  each  variable  was 
made  by  calculating  the  squared  multiple  correlation 
(S.M.C.)  of  each  test  with  the  n-1  other  variables  in 
the  battery.  According  to  recent  empirical  findings 
on  the  problem  of  communality  determination,  the  S.M.C. 
provides  the  best  estimate  of  the  lower  bound  of  the 
communality  (Kaiser,  I960,  Wrigley,  1959).  To  obtain 
the  S.M.C. Ts,  the  intercorrelation  matrix  R^  was 
inverted  with  unity  in  the  diagonal  cells.  The  recip¬ 
rocals  of  the  twenty-one  diagonal  elements  of  the  new 
matrix  R“^  were  each  subtracted  from  one  to  arrive  at 
the  S.M.C. ’s.  Upon  analysis,  these  estimates  were  found 
to  be  too  small  in  eighteen  of  the  twenty-one  variables, 
the  average  discrepancy  being  .041.  Iteration  was 
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performed  by  replacing  the  communality  with  a  value 
increased  (or  decreased)  by  one  and  a  half  times  the 
discrepancy,  resulting  in  an  extremely  fast  conver¬ 
gence.  The  mean  disparity  after  the  first  iteration 
was  only  .007,  so  the  factoring  process  was  termin¬ 
ated  at  this  point.  Since  the  same  process  led  to  a 
correspondingly  fast  convergence  in  other  analyses  in 
this  study,  the  practice  of  changing  the  estimated 
communality  by  one  and  a  half  times  the  discrepancy 
appears  to  have  merit. 

The  final  estimates  of  the  communalities  in  the 
21  by  21  matrix  are  included  in  the  original  inter¬ 
correlation  matrix  Rj_  in  Table  VII. 

Six  principal  components  were  extracted  from 
this  matrix  but  the  last  was  discarded  because  it 
failed  to  reach  the  significance  level  according  to 
the  respective  tests  of  McNemar,  Guilford  and  Lacy, 
and  Burt.  The  first  test  requires  factoring  to  con¬ 
tinue  until  6^  is  less  than  the  standard  error  of  zero 

1  X 

r  ( y^p  ) ,  where  6^  is  the  standard  deviation  of  the 
last  set  of  residuals  divided  by  1  -Mg,  and  M^p  is 
the  mean  communality  to  that  point.  The  Guilford 


and  Lacey  test  requires  the  product  of  the  two  high¬ 
est  loadings  on  a  significant  factor  to  exceed  the 
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CORRELATION  MATRIX  OF  TWENTY-pNE  VARIABLES 


(N  =  271)  (Decimal  Points  Omitted) 


Variable 

1 

2 

3 

4 

5 

6 

7 

8|  9 

10 

11 

12 

13 

14 

15. 

16 

17 

18 

19 

20  21 

1.  Occupational  Status 

(552)* 

2 .  Home  Index 

610 

(582) 

3.  Raven  Matrices 

165 

237 

(542) 

4.  Cattell  g 

189 

244 

637 

(629) 

5.  Lorge-Th.  Fig.  Class. 

113 

152 

439 

441 

(358) 

6.  Lorge-Th.  No.  Series 

181 

176 

457 

422 

387 

(494) 

7.  Lorge-Th.  Fig.  Anal. 

240 

233 

566 

555 

485 

403 

(556) 

8.  CTMM  Number 

276 

282 

538 

466 

318 

510 

448 

(51J) 

9.  CTMM  Verbal 

286 

383 

337 

352 

291 

340 

374 

42? (700) 

10.  CTMM  Spatial 

059 

060 

502. 

465 

373 

369 

471 

34?  187 

(437) 

11.  CTMM  Logical 

286 

239 

527 

493 

362 

383 

455 

51?  469 

382 

(483) 

12.  Lay cock 

309 

324 

507 

498 

435 

438 

526 

554  667 

308 

550 

(740) 

13.  Holz-Crowder  Spatial 

081 

168 

417 

393 

237 

247 

262 

226  136 

310 

284 

268 

(328) 

14.  Holz-Crowder  Series 

239 

302 

358 

387 

338 

532 

396 

47]  425 

342 

373 

451 

277 

(555) 

15.  Holz-Crowder  Fig.  Changes 

143 

237 

494 

457 

347 

388 

379 

425 353 

234 

372 

428 

356 

451 

(442) 

16.  Calif.  Rdg.  Vocab. 

265 

328 

301 

254 

225 

334 

333 

364  717 

145 

355 

643 

176 

427 

334 

(742) 

17.  Calif.  Rdg.  Comp. 

250 

303 

466 

387 

280 

405 

405 

505  694 

213 

459 

662 

161 

461 

427 

681 

(645) 

18.  Calif,  Arith.  Reas. 

177 

234 

349 

360 

308 

368 

365 

416  438 

287 

313 

489 

215 

377 

316 

467 

465 

(371) 

19.  Calif.  Arith.  Fund. 

118 

162 

396 

303 

335 

435 

362 

51C  421 

233 

317 

479 

130 

470 

358 

447 

586 

459 

(621) 

20.  Calif.  Language 

179 

268 

404 

341 

296 

363 

340 

405  580 

250 

372 

621 

224 

500 

405 

656 

653 

427 

532 

(647) 

21.  Calif.  Spelling 

132 

154 

172 

068 

105 

143 

207 

276  449 

026 

184 

441- 

-037 

193 

114 

492 

438 

330 

425 

450  (445) 

*Final  Estimate  of  the  Communality . 


no 


DUE  VARIABLES 


ted) 

9  10  11  12  nS  14  15  16.  17  18  19  20  21 


I) 

(700) 

187  (437) 

469  382  (483) 

667  308  550  (740) 

136  310  284  268  (328) 

425  342  373  451  277  (555) 

353  284  372  428  356  451  (442) 

717  145  355  643  176  427  334  (742) 

694  213  459  662  161  461  427  681  (645) 

438  287  313  489  215  377  316  467  465  (371) 

421  233  317  479  130  470  358  447  586  459  (621) 

580  250  372  621  224  500  405  656  653  427  532  (647) 

449  026  184  441-037  193  114  492  438  330  425  450  (445) 


Ill 


standard  error  of  zero  r,  while  the  criterion  of  Burt 
and  Banks  is  satisfied  when  half  of  the  loadings  of  a 
questionable  factor  exceeds  twice  the  standard  error 
of  a  zero  loading,  where 

S.E.  -  I  n 

zero  r  J N('n— s-1 ) 

(n  =  the  number  of  tests,  s  =  the  number  of  the 

factor,  and  N  -  the  number  of  testees).  (Thomson, 

1951). 

Each  of  these  tests  is  designed  to  assist  the 
experimenter  to  determine  the  rank  of  the  correlation 
matrix  analyzed,  because  the  number  of  dimensions  of 
the  common  factor  space  is  equal  to  the  rank  of  the 
matrix.  Since  neither  rank  nor  communalities  can  be 
estimated  exactly,  and  since  a  change  in  the  estimate 
of  one  affects  the  solution  of  the  other  (Wrigley, 
1959),  the  problem  of  knowing  when  to  stop  factoring 
cannot  be  solved  beyond  doubt.  In  the  present  21  by 
21  matrix,  the  three  significance  tests  agreed  that 
the  sixth  factor  should  be  rejected.  The  fifth  fac¬ 
tor  satisfied  the  required  criterion  in  the  first  two 
tests  and  although  it  failed  to  do  so  in  the  third,  it 
was  retained  for  rotation  and  possible  identification. 

In  relation  to  the  present  discussion,  an 
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interesting  point  was  demonstrated  when  a  second  analysis 
was  performed  on  the  same  correlation  matrix  with  unity 
in  the  diagonals.  This  procedure  is  based  on  the  assum¬ 
ption  that  each  test  is  perfectly  reliable.  In  these 
circumstances,  the  rank  of  the  matrix  is  equal  to  the 
number  of  variables  used  in  the  battery.  It  was  found 
that  the  first  three  factors  were  identical  to  that  of 
the  first  analysis,  except  in  the  absolute  magnitude  of 
the  loadings ,  while  the  fourth  and  fifth  factors  dif¬ 
fered  only  in  their  order  of  appearance.  This  finding 
accords  with  !tfrigleyTs  contention  that  for  matrices 
larger  than  twelve  by  twelve  the  communality  problem 
is  of  minor  importance  in  practice,  if  not  in  theory. 

(195&) 

The  unrotated  factor  matrix  with  adjusted 
S.M.C.’s  in  the  diagonals  is  reproduced  in  Table  VIII. 

At  the  foot  of  the  Table,  the  row  indicating  percentage 
of  common  variance  made  by  each  factor  shows  a  diminish¬ 
ing  proportion,  as  the  Hotelling  solution  requires. 

The  large  percentage  contributed  by  the  first  factor 
is  a  reflection  of  the  high  degree  of  relationship 
between  all  variables  in  the  battery.  In  fact,  only 
one  of  the  twenty-one  correlations  coefficients,  that 
between  spelling  and  spatial  ability,  was  a  negative 
value . 
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TABLE  VIII 

UNAOTATLD  FACTOR  .ATRIA  AND  COMMUNALITIES  FOR 
TVtLNTY-ONE  VARI ABLER . 


(N* 271) 


(Decimal  points  omitted) 


Obtnd.  Lstmtd, 
2 


Variable 

V 

Hq  iII0 

1V0 

vo 

h; 

Occupational  Status 

361 

139 

642 

085  -119 

583 

552 

Home  Index 

425 

161 

620 

090 

056 

602 

582 

Raven  matrices 

667 

-306  -036 

-031  -037 

542 

542 

Cattell  g 

650 

-421 

057 

-151 

019 

626 

629 

Lorge-Th.  Fig.  Class. 

520 

-273  -057 

-047  -066 

355 

358 

Lorge-Th.  No.  Series 

612 

-195  -071 

261  -Oil 

486 

494 

Lorge-Th.  Fig.  Anal. 

652 

-277 

026 

-130  -182 

553 

556 

CTRL  Number 

696 

-104 

013 

175  -192 

563 

571 

CTMM  Verbal 

721 

370 

037 

-185 

038 

694 

700 

CTMM  Spatial 

472 

-439  -090 

-056  -038 

428 

437 

CTMM  Logical 

642 

-163 

088 

-141  -097 

476 

483 

Lay cock 

819 

136  -009 

-202  -071 

735 

740 

Holz-Crowder  Spatial 

377 

-317 

048 

-059 

279 

326 

328 

Holz-Crowder  Series 

652 

-052 

001 

310 

174 

554 

555 

Holz-Crowder  Fig.  Changes 

590 

-184  -015 

077 

228 

440 

442 

Calif.  Rdg.  Vo cab. 

694 

470  -047 

-124 

164 

747 

742 

Calif.  Rdg.  Comp. 

764 

295  -097 

-018 

013 

681 

645 

Calif.  Arith.  Reas. 

594 

074  -084 

044  -036 

369 

371 

Calif.  Arith.  Fund. 

647 

146  -276 

301  -148 

629 

621 

Calif.  Language 

711 

277  -161 

006 

181 

641 

647 

Calif.  Spelling 

425 

452  -161 

-044  -173 

443 

445 

%  of  Common  Variance 


69.8%  14.3%  8.5%  4.1%  3.3%  100% 


%  of  Total  Variance 


38.1%  7.8%  4.6%  2.3%  1.8%  54.6% 
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The  two  right  hand  columns  of  Table  VIII  reveal 
the  closeness  of  the  communality  estimates  for  the 
last  iteration,  only  three  of  the  discrepancies  being 
greater  than  .009. 

(b)  Rotation.  As  the  principal  components 
structure  is  an  artifact  of  the  test  battery,  and 
hence  not  invariant,  it  is  necessary  to  rotate  the 
obtained  configuration  of  test  vectors  to  a  position 
which  could  be  regarded  as  psychologically  meaningful. 
Factor  analysts  differ  over  the  procedure  of  rotation, 
some  preferring  to  maintain  orthogonality  between 
dimensions,  others  leaning  to  an  oblique  structure. 

In  the  case  of  abilities,  the  oblique  rotation  often 
indicates  a  second-order  general  factor  which  could 
be  attained  more  directly  by  an  orthogonal  rotation 
designed  to  maximize  the  general  factor  while  keep¬ 
ing  the  remaining  group  factors  independent  of  one 
another.  Therefore,  orthogonal  rotation  procedures 
were  employed  in  this  study. 

The  criteria  of  rotation  used  in  the  first 
analysis  were  those  of  positive  manifold  (Thurstone, 
1947)  and  psychological  meaningfulness.  These  two 
criteria  are  almost  universally  accepted  since  in  the 
first  case  bi-polar  ability  factors  are  hard  to  justify, 
and  in  the  second  it  seems  axiomatic  to  state  that 


:  .  ,  ’■  -  .  -I.  • 

. 

. 

- 

■  '  :  '  .  Vj  ' 

'  '  c  ' 

■  :•  ‘  '  V  ‘  ■/  '  ‘  ‘  a  : >  1 .  J  ...  i 


:  <  .1  ‘  L  •  .  :;i  J'.  '  i  :  . 

•  '  .  .  ' 

ii.’  :  .  .  .  ■  ..*>.■ 

.  .  1  >’  •.  '  1  i.  •  ;  t 

<  .  i  .  <  I  :  ■  .  ...• 

:  ■  ■  .  :  ■  - 

i.i  •  i 

. 

(  c 

.  ■  .  .  i 


115 

mathematical  devices  must  always  serve  the  purpose  of 
psychological  usefulness.  The  other  criterion  used 
in  rotation,  that  of  maximizing  the  general  factor, 
has  wide  but  not  universal  support.  For  instance,  the 
concept  of  simple  structure  does  not  allow  for  a  gen¬ 
eral  factor  amongst  the  first  order  factors  and  is 
therefore  inappropriate  for  this  study. 

In  the  first  analysis,  the  three  criteria  of 
positive  manifold,  maximum  g  and  psychological  mean¬ 
ing  were  used  and  four  rotations  were  found  necessary 
to  attain  a  satisfactory  position.  Factor  I  was 
rotated  with  Factors  II,  III,  and  V  respectively  and 
Factors  II  and  III  were  paired  for  the  last  rotation. 
It  should  be  added  that  the  criterion  of  maximizing 
g  would  have  been  unnecessary  if  it  had  been  decided 
beforehand  to  put  the  rotated  axis  through  the  clus¬ 
ter  of  points  which  would  have  produced  the  greatest 
number  of  zero  loadings  and  no  negative  values.  In 
each  rotation,  the  new  axis  position  was  relatively 
clear  and  yet  a  large  general  factor  emerged. 

( c )  Interpretation  of  Factors ♦  Table  IX  shows 
the  factor  pattern  which  emerged  after  four  rotations. 
The  interpretation  of  the  factors  was  relatively 
straightforward  in  this  analysis. 
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Thus  Factor  I,  with  high  positive  loadings  on  nearly 
all  variables  and  over  fifty  per  cent  of  the  communal 
variance  is  obviously  the  general  factor,  or  some 
approximation  to  Spearman’s  g.  It  is  important  to 
note  that  the  three  highest  g  loadings  are  contrib¬ 
uted  by  three  of  the.  seven  culture-reduced  measures 
used  in  this  battery.  The  Raven  Matrices,  Cattell’s 
Test  of  g,  and  the  Figure-Analogies  sub-test  of  the 
Lorge-Thorndike  battery  all  conform  to  the  conditions 
predicted  in  Hypothesis  I,  namely;  high  g  loading  and 
no  high  group  factor  loading.  Three  more  sub-tests 
numbers  5,  10,  and  11  approach  these  criteria  but 
show  lower  g  loadings.  As  expected,  the  Laycock  g 
loading  is  high,  since  it  was  designed  according  to 
Spearman’s  theory,  while  the  socio-economic  variables 
and  spelling  are  understandably  low. 

The  second  factor  has  its  highest  loadings  on 
two  vocabulary  tests,  but  since  it  is  also  heavily 
weighted  in  the  arithmetic  tests,  it  appeared  to  be 
the  familiar  v:ed  factor,  which  recurs  so  often  in 
English  analyses  of  school  children  (Vernon,  1950). 

Factor  III  is  extremely  clear  as  it  has  two 
very  high  loadings  and  practically  nothing  else. 

This  could  be  called  socio-economic  status  after  the 
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ROTATED  FACTOR  SOLUTION  AND  COMMUNALITIES  FOR 
TWENTY-ONE  VARIABLES. 


(Loadings  below  .20  omitted) 

I3  II.-  iii1  ivT 

A 

h£ 

Occupational  Status 

.251 

- 

.683 

- 

- 

.583 

Home  Index 

.247 

.205 

.669 

- 

- 

.602 

Raven  Matrices 

.708 

- 

- 

- 

- 

.542 

Cattell  g 

.754 

- 

- 

- 

•222 

.626 

Lorge-Th.  Fig.  Class. 

.582 

- 

- 

- 

- 

.355 

Lorge-Th.  No.  Series 

.547 

— 

- 

.395 

- 

.486 

Lorge-Th.  Fig.  Anal. 

.738 

- 

- 

- 

- 

.553 

CTMM  Number 

.639 

o  201 

- 

.331 

- 

.563 

CTMM  Verbal 

.458 

.658 

- 

- 

- 

.694 

CTMM  Spatial 

,612 

- 

- 

- 

- 

.428 

CTMM  Logical 

,660 

- 

- 

- 

- 

.476 

Lay cock 

,676 

.509 

- 

- 

- 

,735 

Holz-Crowder  Spatial 

.397 

- 

- 

- 

.396 

.326 

Holz-Crowder  Series 

,457 

,214 

- 

,453 

,303 

.554 

Holz-Crowder  Fig.  Changes 

.498 

©  “ 

- 

.  211 

.369 

.440 

Calif.  Rdg.  Vocab. 

,335 

,743 

- 

- 

.264 

.747 

Calif.  Rdg.  Comp. 

,501 

.618 

- 

- 

- 

.681 

Calif.  Arith.  Reas. 

.460 

.341 

- 

- 

- 

,369 

Calif,  Arith.  Fund. 

.450 

.436 

- 

.442 

- 

,629 

Calif.  Language 

,417 

.589 

- 

- 

.299 

.641 

Calif.  Spelling 

,201 

,618 

*= 

,443 

%  of  Common  Variance 

51.18% 

25.23% 

9,11% 

7.80% 

6068% 

100% 

%  of  Total  Variance 


27 s 94%  1-3.78%  4.97%  4.26%  3.65%  54.60% 
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measurable  aspects  of  general  ability  and  educational 
correlates  have  been  removed.  Presumably,  such 
characteristics  as  wealth,  ambition,  social  desirabi¬ 
lity,  ostentation  would  be  characteristic  of  this 
factor. 

Four  numerical  tests  figure  prominently  in  the 
fourth  dimension.  These  are  predominantly  non-verbal 
tests  requiring  only  facility  with  the  manipulation  of 
numbers.  The  fact  that  the  Arithmetic  Reasoning  test 
is  so  inconspicuous  in  this  factor  might  be  attributed 
to  the  verbal  setting  of  the  problems,  which  swings 
much  of  the  variance  to  the  v:ed  factor. 

It  will  be  recalled  that  the  fifth  factor 
barely  reached  the  significance  level.  Understandably, 
then,  the  interpretation  of  this  factor  is  less  clear- 
cut.  A  slight  rotation,  designed  to  eliminate  the 
minus  loadings,  left  the  three  Holzinger-Crowder  tests 
and  the  California  English  test  as  the  most  prominent 
on  this  factor.  Since  these  tests  are  all  rather  long, 
and  the  first  three  highly  speeded,  the  dimension  is 
probably  one  of  speed.  However,  as  its  precise  inter¬ 
pretation  is  not  necessary,  no  definite  label  will  be 
attached  to  it. 

The  fact  that  no  spatial  factor  appeared  in  the 
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analysis  is  probably  due  to  the  low  ceiling  of  one  of 
the  two  spatial  sub-tests.  Only  the  last  four  items 
of  the  CTMM  Spatial  Test  caused  the  students  much  dif¬ 
ficulty,  and  close  inspection  of  these  items  indicates 
that  they  depend  more  upon  logical  reasoning  than 
spatial  manipulation.  Since  it  requires  at  least  two 
variables  to  determine  a  separate  dimension,  no  spatial 
factor  was  found. 

Analysis  of  Total  Tests .  The  eleven  total  tests 
were  factor  analyzed  by  the  same  procedures  as  those 
employed  in  the  foregoing  analysis.  The  correlation 
matrix  is  reproduced  in  Table  X  and  the  rotated  factor 
solution  in  Table  XI.  Only  one  iteration  was  necessary 
to  enable  the  communality  estimates  to  converge  on  the 
obtained  values. 

Three  significant  factors  were  sufficient  to 
account  for  the  common  variance,  and  all  three  were 
rotated  and  identified.  Once  again,  the  criteria  of 
positive  manifold  and  maximum  g  gave  psychologically 
meaningful  results,  this  time  after  two  rotations. 
Inspection  of  the  loadings  of  those  tests  common  to  the 
batteries  suggests  a  high  degree  of  invariance  in  the 
dimensions  obtained  and  in  the  factor  saturations  of 
these  variables.  This  is  clearly  brought  out  in 
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TABLE  X 

CORRELATION  MATRIX  OF  ELEVEN  TEST  SCORES 


(N  =  271) 
Variable 


(Decimal  points  omitted) 

1_ 2  3  4  5  6  7 


S 


10  11 


1.  Raven  Matrices 

2.  Cattell  g 

3.  Lorge— Thorndike 

4.  CTMM  Language 

5.  CTMM  Non -Lang. 

6.  Laycock 

7.  Holz-Crowder 

Series 

8.  Holz-Crowder 

Fig.  Ch. 

9.  Calif.  Rdg. 

10.  Calif.  Arith. 

11.  Calif.  Eng. 


(620)K 
637  (645) 

629  614  (666) 

473  473  549  (727) 

577  505  605  443  (515) 

507  498  601  732  431  (741) 

358  387  531  492  460  451  (375) 

494  457  470  434  384  428  451  (375) 

425  356  467  772  351  713  486  419  (800) 

435  369  532  562  425  550  504  393  629  (574) 

381  297  408  619  353  642  466  363  733  598  (660) 


Final  estimate  of  the  communality. 
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TABLE  XI 

ROTATED  FACTOR  SOLUTION  AND  COMMUNALITIES 
FOR  ELEVEN  TOTAL  TESTS 


(Loadings  below  .20  omitted) 

Variables  I  jjj  h2 


Raven  Matrices 

.730 

- 

- 

.634 

Cattell  g 

.792 

- 

- 

.639 

Lorge -Thorndike 

.740 

- 

.325 

.673 

CTMM  Language 

.530 

.  616 

- 

.720 

CTMN  Non-Lang. 

.624 

- 

.349 

.516 

Laycock 

.634 

.573 

- 

.730 

Kolz-Crowder  Series 

.399 

to 

0^\ 

CV\ 

. 

.531 

.556 

Holz-Crowder  Fig. 

Changes 

.521 

- 

.256 

.374 

Calif.  Reading 

.  444 

.784 

- 

.319 

Calif.  Arithmetic 

o  430 

.509 

.343 

.562 

Calif.  English 

.361 

.702 

- 

.654 

%  of  Common  Variance  55.97%  32. S3 %  11.20%  100.00% 


%  of  Total  Variance  35.02%  20.55%  7.01%  62.57% 
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Table  XII,  which  shows  that  the  mean  discrepancy  in 
corresponding  loadings  is  less  than  .06. 


TABLE  XII 

COMPARISON  OF  FACTOR  LOADINGS  OF  TESTS  COMMON 
TO  SUB-TEST  AND  TOTAL  TEST  BATTERIES 


Variable 

Battery 

S 

v:ed 

n 

Raven  Matrices 

1st 

.  70S 

- 

- 

2nd 

.730 

- 

- 

Cattell  g 

1st 

.754 

- 

- 

2nd 

.792 

- 

- 

Laycock 

1st 

.676 

.509 

- 

2nd 

.634 

.573 

- 

Holz-Crowder  Series 

1st 

.457 

.214 

.453 

2nd 

.399 

.338 

.531 

Holz-Crowder  Fig. 

1st 

.493 

- 

.211 

Changes 

2nd 

.521 

— 

.256 

<= 
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The  labelling  of  the  factors  in  the  eleven  test 
battery  was  facilitated  by  the  sub-test  analysis. 

Once  again,  g,  which  accounted  for  fifty-six  per  cent 
of  the  common  variance,  was  very  large.  As  before,  it 
showed  highest  loadings  on  the  same  culture-reduced 
tests.  The  v:ed  dimension  was  second  in  importance 
with  highest  loadings  on  verbal  intelligence  and 
school  achievement  tests.  Factor  III  corresponded 
to  the  numerical  factor  in  the  first  analysis,  since 
it  showed  the  highest  loadings  on  a  series  test,  and 
on  batteries  which  included  arithmetic  and  series 
sub-tests . 

Summary.  It  is  clear  from  these  two  analyses 
that  the  selected  culture-reduced  tests  of  intelligence 
can  be  interpreted  as  measures  of  one  broad  factor  of 
intellectual  ability  which  enters  into  many  mental 
tests.  Only  the  series  tests  appear  to  be  measuring  a 
common  factor  apart  from  g,  a  fact  which  is  readily 
understood  by  inspection  of  the  tests. 

In  each  analysis  the  most  sensitive  measures  of 
the  general  factor  are  the  Raven  Matrices,  CattellTs 
Test  of  g,  and  the  Figure  Analogies  sub-test  of  the 
Lorge-Thorndike  Test.  No  test  of  significance  can 
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be  made  to  investigate  differences  between  loadings, 
but  since  these  tests  indicated  consistent  superio¬ 
rity,  they  were  examined  carefully  for  conformity 
to  further  criteria  of  evaluation. 
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CHAPTER  IX 


RELATIONSHIP  WITH  SOCIO-ECONOMIC  STATUS 

To  examine  the  second  hypothesis,  that  the 
culture-reduced  measures  of  intelligence  were  more 
independent  of  socio-economic  status  than  were  the 
culture-loaded  tests,  two  approaches  were  taken. 
Correlations  with  social  status  were  compared  for 
significant  differences,  and  mean  T  scores  for  five 
separate  socio-economic  levels,  on  each  test,  were 
tabulated  and  graphed  to  illustrate  trends  and 
irregularities  in  the  relationship. 

Correlations  with  Socio-Economic  Status.  Corre¬ 
lation  coefficients  were  calculated  by  the  Pearson  Pro¬ 
duct-Moment  method  once  the  required  assumptions  for 
the  variables  tested  were  shown  to  be  adequately  met. 

The  occupational  index  was  already  in  standard  score 
form,  but  since  the  distribution  was  skewed  to  the  right, 
the  scores  were  normalized  before  correlation.  The 
Home  Index  raw  scores  showed  a  normal  distribution  as 
did  the  combined  socio-economic  score.  Other  tests 
showing  skewness  were  tested  for  normality  by  means  of 

a  X2 


test.  As  a  result  the  spatial  and  logical  sub- 
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tests  of  the  CTMM  were  found  to  be  significantly  ab¬ 
normal.  This  finding  was  attributed  to  the  low  ceil¬ 
ing  of  these  two  sub-tests  for  the  present  sample. 

Since  all  other  variables  were  normally  distributed, 
it  seemed  reasonable  to  assume  that  these  two  tests 
would  have  shown  normal  distributions  if  more  diffi¬ 
cult  items  had  been  included.  Therefore,  the  two  tests 
were  converted  to  T  scores  before  correlation.  All 
other  tests  revealed  normal  distributions  in  raw  score 
form. 

When  an  attempt  was  made  to  examine  the  effects 
of  these  normalization  procedures  on  subsequent  calcu¬ 
lations,  it  was  found  that  correlations  between  each 
of  thirty  variables  and  the  normalized  variables  were 
all  within  .05  of  the  corresponding  correlations  with 
the  raw  data.  In  fact,  the  majority  were  within  .01 
of  the  corresponding  coefficient.  Apparently,  normal¬ 
ization  had  little  effect  on  the  results. 

Another  approach  to  the  same  problem  was  taken 
by  calculating  the  value  of  eta  for  the  correlation 
between  CTMM  Language  scores  and  Occupation  raw  scores. 
Since  the  value  obtained  (.349),  was  very  close  to  the 
corresponding  Pearson  r  (.333)  between  CTMM  Language 
score  and  Occupation  T  score,  it  was  again  assumed 
that  normalization  had  made  little  difference. 
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The  Pearson  r’s  computed  between  the  two  social 
status  variables  and  their  combined  scores,  the  final 
measure  of  socio-economic  status,  are  presented  in 
Table  XIII.  Inspection  of  this  Table  shows  the  close 
correspondence  between  the  first  two  columns  (rho  =  -f  „$7), 
although  the  Home  Index  correlations  are  slightly  higher 
in  most  cases.  To  facilitate  interpretation  of  the 
table  in  the  light  of  the  second  hypothesis,  culture- 
reduced  intelligence  tests  and  the  two  spatial  tests 
were  listed  together  in  the  left  half  of  the  table, 
and  culture-loaded  tests  appear  on  the  right.  The 
general  pattern  of  lower  correlations  between  socio¬ 
economic  variables  and  culture-reduced  tests  can  be 
determined  by  comparing  them  with  the  corresponding 
columns  on  the  right.  Clearly,  spatial  tests  show  the 
least  relationship,  but  as  they  failed  to  meet  the 
other  criteria  of  culture-reduced  intelligence  tests, 
they  were  considered  unsuitable  for  futher  study. 

These  tests  were  the  only  ones  which  failed  to  show 
significant  correlations  with  socio-economic  status. 

To  test  the  significance  of  the  observed  cor¬ 
relation  differences  between  the  two  halves  of  the 
Table,  the  following  formula  cited  by  McNemar  (1955) 
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PEARSON  CORRELATION  COEFFICIENTS  BETWEEN  THREE  MEASURES  OF 
SOCIO-ECONOMIC  STATUS  AND  SELECTED  CULTURE-REDUCED 
AND  CULTURE-LOADED  TESTS 
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was  used : 

( ''12  -  r13)y/  (N  -  3)  (l.  r23) 

/s  (1  -  i-12  -  r13  -  1*23  +  2r12  rl3  r23^ 

This  test  is  appropriate  for  comparing  rT s  in  a  situation 
where  two  variables  are  correlated  with  a  common  third  on 
the  same  sample,  since  allowance  is  made  for  the  fact 
the  the  two  correlations  are  not  independently  distrib¬ 
uted  in  successive  samplings. 

Thus,  nine  culture-reduced  and  six  typical  cul¬ 
ture-loaded  tests  were  selected,  and  t  values  for 
differences  in  their  relationship  with  socio-economic 
status  were  tested  and  tabulated,  as  in  Table  XIV. 

In  this  table,  forty-two  of  the  fifty-four 
obtained  t  values  are  significant  beyond  the  ,05  level. 

A  one-tailed  test  was  used,  since  it  had  been  hypothe¬ 
sized  that  the  verbal  test  correlations  would  be  higher. 
Half  of  the  non-significant  values  are  accounted  for  by 
the  CTMM  Numerical  sub-test,  which  is  partly  non-verbal, 
while  four  of  the  remainder  are  attributable  to  the  Hoi- 
&inger-Crowder  Series  Test,  which  contains  items  requir¬ 
ing  familiarity  with  alphabetical  relationships.  If 
these  two  variables  were  omitted,  only  two  of  the  remain¬ 
ing  forty  differences  fail  to  meet  the  .05  significance 
level.  Clearly,  the  hypothesis  can  be  accepted  that 
culture-loaded  tests  differentiate  between  socio-economic 
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VALUES  OF  t  FOR  DIFFERENCES  IN  CORRELATION 
COEFFICIENTS  BETWEEN  SOCIO-ECONOMIC 
STATUS  AND  SELECTED  TESTS 
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levels  more  than  do  culture  reduced  tests. 

Examination  of  the  marginal  rows  of  Table  XIV 
indicates  that  the  CTMM  Language  test  has  the  highest 
relationship  with  socio-economic  status,  while  the 
addition  of  the  Non-Language  total  is  insufficient  to 
affect  the  relationship  very  much.  The  high  correla¬ 
tion  and  t  value  for  the  Verbal  sub-test  suggests  that 
most  of  the  cultural  loading  is  associated  with  this 
test.  The  Laycock  test,  too,  shows  significant  t 
values  in  all  but  one  of  the  rows,  and  comparison  with 
the  CTMM  results  indicates  that  the  difference  in  cul¬ 
tural  loading  between  the  two  tests  could  well  be  due 
to  chance. 

The  marginal  columns  show  that  the  mental  tests 
which  discriminate  least  between  social  classes  are 
the  first  two  sub-tests  of  the  Lorge -Thorndike  Non- 
Verbal  test  and  the  CTMM  Non-Language  Total.  Since 
all  three  of  these  measures  have  relatively  low  g 
loadings  (.5$,  .55,  and  .63  respectively)  and  the 
second  two  are  measuring  also  the  numerical  factor, 
their  value  as  culture-reduced  measures  of  general 
intellectual  ability  is  not  impressive.  The  Holzinger- 
Crowder  Figure  Changes  test  is  seen  to  be  unsatisfac¬ 
tory  for  the  same  reasons. 
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Of  the  remaining  tests,  the  Progressive  Mat¬ 
rices  has  the  least  relationship  with  socio-economic 
status  {,228),  although  it  surpasses  the  Cattell  test 
(.244)  by  only  a  small  margin.  The  Figure  Analogies 
sub-test  of  the  Lorge -Thorndike  test  is  slightly  less 
promising  (.262),  but  the  instrument  shows  a  suffici- 
ciently  high  g  loading  (.74)  to  merit  further  study. 

It  might  be  argued  that  a  really  "culture-fair" 
test  would  show  zero  correlations  with  socio-economic 
status.  However,  the  present  investigator  has  assumed 
that  the  combined  effects  of  heredity  and  early  envi¬ 
ronment,  coupled  with  the  cultural  influences  which 
operate  in  any  booklet  testing  situation  are  suffici¬ 
ent  to  produce  some  superiorit}r  for  children  from  the 
upper  classes.  Hence,  it  is  argued  that  the  correla¬ 
tions  shown  between  the  three  tests  listed  above  are 
low  enough  to  suggest  a  significant  amount  of  culture- 
reduction,  and  therefore  a  more  suitable  test  for  all 
socio-economic  classes.  These  three  instruments,  the 
Progressive  Matrices,  the  "Culture-Free"  test  of  g, 
and  the  Lorge-Thorndike  Figure  Analogies  test,  conform 
well  to  the  first  two  criteria  of  culture-reduced  tests 
of  intelligence. 
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Per  '  nee  at  )ifferent  Socio-Economic  Levels. 


A  second  approach  to  the  examination  of  the  relation¬ 
ship  between  socio-economic  status  and  abilities  was 
taken  by  converting  the  raw  scores  of  each  intelli¬ 
gence  test  to  T  scores,  and  comparing  the  means  of 
each  socio-economic  fifth  on  all  tests. 

The  distributions  were  standardized  to  a  common 
mean  of  50  and  standard  deviation  of  10  in  order  to 
facilitate  the  plotting  and  comparison  of  each  test. 

Then  the  sample  of  two  hundred  seventy-one  children  was 
divided  into  five  groups  of  approximately  equal  size  on 
the  basis  of  their  socio-economic  score.  The  division 
was  made  arbitrarily  on  psychometric  grounds  only  so 
that  no  equation  with  separate  identifiable  social 
classes  would  appear  meaningful.  It  is  quite  conceiv¬ 
able  that  social  contacts  are  just  as  numerous  between 
such  social  levels  as  they  are  within  them. 

The  T  score  means  and  standard  deviations  for 
the  major  intelligence  tests  are  set  out  in  Table  XV 
and  for  the  sub-tests  in  Table  XVI.  These  Tables 
illustrate  in  a  different  way  the  same  findings  as  the 
correlation  comparisons  presented  in  Table  XIV.  The 
culture-reduced  tests  show  less  differentiation  between 
socio-economic  levels  than  do  the  Laycock  and  CTMM 

Language  tests.  Thus  the  absolute  discrepancies  between 
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means  for  the  highest  and  lowest  groups  on  the  Raven, 
Cattell  and  CT1YLM  Non-Language  tests  are  6.67,  7.55, 
and  5.33  respectively,  while  the  Laycock  and  CTMM  Lan¬ 
guage  tests,  by  contrast,  show  corresponding  discrep¬ 
ancies  of  10.20  and  11.34*  These  relationships  are 
brought  out  more  clearly  in  Figures  1  and  2,  which  show 
the  discrepancies  in  graphical  form. 

It  can  be  seen  from  these  tables  and  graphs  that 
the  majority  of  tests  show  a  regular  linear  relationship 
with  socio-economic  status.  Conspicuous  exceptions  to 
this  pattern  are  the  two  spatial  sub-tests,  and  the  CTMM 
Non-Language  Total,  which  is  heavily  weighted  with  a 
spatial  test.  Spatial  ability  has  previously  been  shown 
to  have  little  relationship  with  social  class  (Havighurst 
and  Bre  se ,  1947);  therefore,  the  irregular  pattern  of 
means  is  not  unexpected  with  these  tests.  The  reason 
for  non-linearity  in  the  case  of  the  two  Lorge-Thorndike 
and  Holzinger-Crowder  sub-tests  is  not  clear,  but  may 
well  be  due  to  test  unreliability. 

The  similarity  in  the  patterns  of  the  California 
and  Laycock  Intelligence  Tests  is  close  enough  to  merit 
attention.  On  the  present  sample,  and  with  the  socio¬ 
economic  variables  used,  it  is  apparent  that  performance 
on  verbal  intelligence  tests  is  more  closely  related  to 
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socio-economic  status  at  the  upper  than  at  the  lower 
end  of  the  socio-economic  scale.  In  fact,  several 
cultural-loaded  tests  were  unable  to  differentiate 
clearly  between  the  third  and  fourth  socio-economic 
categories.  This  observation  suggests  either 

a.  a  similarity  in  the  environmental  stimulation 
to  which  the  children  in  these  two  categories  are  sub¬ 
jected  when  this  stimulation  is  judged  by  verbal  and 
numerical  tests,  or 

b.  the  importance  of  some  uncontrolled  variable 
near  the  lower  end  of  the  socio-economic  scale;  for 
example,  xiiller  and  Swanson’s  (1953)  bureaucratic  - 
entrepreneurial  dichotomy. 

No  conclusions  have  been  reached  about  the  rela¬ 
tive  importance  of  such  variables. 

A  contrast  can  be  drawn  between  the  foregoing 
findings  and  those  of  an  earlier  investigation.  On 
the  basis  of  his  American  study,  Eells  (1951)  concluded 
that  the  least  discrimination  appeared  at  the  upper  end 
of  the  social  scale.  The  fact  that  the  present  study 
shows  greater  discrimination  in  the  top  three  fifths  is 
probably  a  reflection  of  the  difference  in  both  the 
social  structure  of  the  Edmonton  community  and  the  dif¬ 
ferent  measuring  scale  used  in  this  study.  As  it  was 
pointed  out  previously,  Warner’s  upper-upper  class  is 
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not  prominent  in  Edmonton;  hence,  the  upper  end  of  the 
socio-economic  scale  in  this  investigation  is  probably 
equivalent  to  the  lower-upper  and  upper-middle  classes 
of  Earner’s  scale.  The  results  of  the  present  research 
are  more  in  line  with  studies  based  on  occupational 
levels  classified  according  to  professional,  skilled, 
unskilled  workers,  and  so  on.  (For  example,  Terman 
and  Merrill,  1937) 

To  conclude  this  chapter,  it  has  been  shown 
that  on  the  present  sample  verbal  intelligence  tests 
showed  significantly  higher  correlations  with  socio¬ 
economic  status  than  the  selected  culture-reduced 
measures.  Spatial  and  Figure  Analogies  tests  showed 
lowest  correlations  with  socio-economic  status,  but 
the  relationships  were  non-linear.  The  Raven  and 
Cattell  tests  also  showed  low  correlations  while  the 
pattern  of  the  relationship  was  similar  at  each  socio¬ 
economic  level. 


CHAPTER  X 


CHANGES  IN  CORRELATIONS 
OVER  FOUR  YEARS 

Assumptions  of  the  Longitudinal  Hypothesis 0  If 
socio-economic  status  were  an  ongoing  environmental 
influence  on  test  scores,  it  could  be  expected  to 
increase  its  effects  as  the  testees  grew  older.  To 
test  this  assumption  Hypothesis  III  predicted  that  the 
correlation  between  socio-economic  status  and  a  culture 
loaded  test  (the  CTMM)  would  increase  significantly 
between  Grades  III  and  VII.  During  these  four  years 
the  variables  reflected  in  the  social  status  scale  were 
presumably  operating  differentially  on  the  mental 
development  of  the  children  of  each  social  class. 

On  the  other  hand,  a  test  which  merits  the 
adjective  Ttculture-reducean  should  show  correspondingly 
little  increase  in  correlation  with  socio-economic 
status  during  the  same  period.  The  hypothesis  stated 
that  the  Progressive  Matrices  test,  which  was  adminis¬ 
tered  in  both  surveys,  would  show  similar  correlations 
with  socio-economic  status  at  both  grade  levels. 

The  third  sub-hypothesis,  designed  to  investigat 
the  same  trends  by  another  method , required  the  changes 
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in  T  score  on  the  culture-loaded  test  to  show  higher 
correlations  with  socio-economic  status  than  T  score 
changes  on  the  culture-reduced  test. 

Since  only  one  intelligence  test  of  each  degree 
of  cultural  loading  was  administered  in  the  Grade  III 
survey,  the  hypotheses  of  this  chapter  were  concerned 
with  two  tests  only,  the  CTMM  and  the  Progressive 
Matrices.  Generalization  to  other  tests  is  naturally 
limited  by  this  restriction,  but  the  similarity  of  these 
two  instruments  to  the  Laycock  and  Gattell  tests  respec¬ 
tively,  in  terms  of  dimensions  measured  and  relation¬ 
ships  with  socio-economic  status,  has  already  been  pointed 
out.  Conclusions  about  other  tests  in  the  battery  must 
necessarily  await  further  study. 

Another  restrictive  assumption  of  these  hypotheses 
is  that  the  tests  used  are  sampling  similar  abilities  at 
both  ages.  Inspection  of  the  tests  suggests  that  this 
assumption  is  reasonable,  for  the  Raven  Matrices  con¬ 
tains  twenty-four  items  in  each  test  which  could  be 
considered  almost  identical,  apart  from  the  element  of 
colour.  The  remaining  items  are  very  similar  to  those 
common  to  both  tests.  The  sub-tests  of  the  CTMM  are 
also  very  similar  at  both  levels,  but  these  similarities 
are  no  guarantee  that  the  tests  do  not  measure  different 
functions.  For  instance,  an  unfamiliar  verbal  item  at 
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Grade  III  may  be  solved  by  logical  deduction;  at  Grade 
VII,  when  the  pupil  has  had  more  acquaintance  with  the 
contents  of  the  item,  it  may  be  chiefly  a  matter  of 
memory.  If  environmental  influences  are  reduced  to  a 
minimum,  however,  as  we  have  hypothesized  in  the  Raven 
tests,  it  seems  probable  that  any  changes  in  mental 
function  required  would  be  of  lesser  magnitude  for  simi¬ 
lar  items.  If  this  is  the  case,  scores  on  the  culture- 
reduced  test  are  still  likely  to  show  less  variability 
over  four  years  than  scores  on  the  culture-loaded  test. 
Consequently,  the  effects  of  an  invalid  assumption  in 
the  case  of  the  CTMM  will  not  be  very  serious  in  test¬ 
ing  this  hypotbsis.  In  addition,  it  should  be  pointed 
out  that  test  scores  at  Grade  III  and  Grade  VII  are 
commonly  interpreted  as  directly  comparable,  and  are 
used  to  calculate  intelligence  quotients  at  both  levels. 
It  is  still  important,  then,  to  note  changes  in  the 
relationship  between  socio-economic  status  and  CTMM 
scores,  whether  the  intelligence  quotient  calculated 
from  these  scores  is  a  composite  of  the  same  abilities 
at  each  level  or  of  a  different  set.  School  authorities 
usually  interpret  it  as  being  the  same. 

Some  light  may  be  shed  on  the  justification  for 
the  assumption  that  similar  abilities  were  tested  at 
both  ages,  by  pointing  out  that  the  correlation  between 
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the  two  Matrices  tests,  administered  exactly  four  years 
apart,  was  *545,  while  the  corresponding  coefficient 
for  the  CTMM  Total  was  .471.  Apparently,  the  amount  of 
the  common  variance  in  the  two  Raven  tests  is  only 
slightly  greater  than  the  variance  common  to  both  CTMM 
tests.  Now,  if  some  of  the  variance  not  attributable 
to  the  relationship  between  Grades  III  and  VII  results 
can  be  accounted  for  by  socio-economic  status  in  the 
case  of  the  CTMM  test  then  the  hypothesis  of  cultural 
influence  gains  added  support.  If  the  corresponding 
variance  in  the  Raven  Tests  is  not  to  be  attributed  to 
socio-economic  variables,  it  must  presumably  be  due  to 
chance  or  some  unidentified  factor.  The  point  made  here, 
is  that  both  instruments  showed  a  sufficiently  low  re¬ 
test  correlation  for  the  influence  of  those  socio¬ 
economic  variables  which  operated  between  testing 
occasions  to  show  their  effects.  Moreover,  the  oppor¬ 
tunity  for  them  to  do  so  is  similar  in  each  case. 

Results.  The  sample  used  for  testing  this  hypo¬ 
thesis  was  smaller  than  the  main  sample  of  the  investi¬ 
gation.  Of  the  four  hundred  thirty-two  pupils  tested 
in  Grade  VII  in  I960,  one  hundred  fifty-six  were  not 
living  in  Edmonton  when  the  Grade  III  tests  were  admin¬ 
istered  in  1956,  and  another  twenty-three  missed  one  or 
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more  tests  at  Grade  VII.  This  left  two  hundred  forty- 
three  pupils  for  the  testing  of  this  hypothesis. 

Whether  the  change  in  sample  affected  the  results  can 
be  seen  by  comparing  results  common  to  both  samples. 

As  the  data  for  the  Grades  III  to  VII  comparisons  were 
normalized  before  correlation,  only  the  Home  Index  var¬ 
iable  could  be  compared  on  the  basis  of  mean  and  standard 
deviation.  These  figures  showed  a  difference  between 
means  of  .34  and  a  difference  between  standard  devia¬ 
tions  of  .01.  Since  neither  figure  approaches  signifi¬ 
cance,  the  samples  are  considered  equivalent  on  this 
variable.  Further  comparisons  were  made  by  pairing  the 
computed  correlation  coefficients  common  to  each  sample. 
These  are  reproduced  in  Table  XVII. 

Since  the  largest  discrepancy  in  this  Table,  that 
between  the  Matrices  and  socio-economic  status,  yields 
a  critical  ratio  of  only  1.4,  and  since  the  median  dif¬ 
ference  between  corresponding  correlations  is  only  .036, 
the  problem  of  sampling  bias  was  not  thought  to  be 
serious.  The  one  substantial  discrepancy  was  carefully 
checked  and  investigated  further,  as  the  relationship 
was  considered  an  important  one  in  this  study.  It  was 
found  that  the  fifty-eight  students  omitted  from  the 
cross-sectional  sample,  but  included  in  the  longitudinal 
one,  showed  a  correlation  coefficient  of  only  .056  between 
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Matrices  T  score  and  socio-economic  status,  while  the 
corresponding  figure  for  the  pupils  included  in  the 
cross-sectional  but  not  the  longitudinal  study  was  as 
high  as  .33$.  In  other  words,  the  students  included  in 
one  sample  only  were  atypical  in  this  one  relationship, 
but  no  other.  Since  the  observed  discrepancy  between 
the  two  large  samples  was  insignificant,  the  change  was 
attributed  to  chance.  The  effects  of  normalizing  the 
data  before  correlation  were  also  checked  and  found  to 
be  negligible. 

Testing  of  Hypothesis  III  was  performed  by  corre¬ 
lating  socio-economic  status  with  Grade  III  and  Grade 
VII  data,  and  with  changes  in  T  scores  between  corres¬ 
ponding  tests  at  each  level.  The  significance  of 
differences  between  correlation  coefficients  was  calcu¬ 
lated  by  the  same  procedure  as  for  Hypothesis  II,  and 
one-tailed  tests  of  significance  were  used  in  each  case. 
'The  results  for  the  first  two  sub-hypotheses,  presented 
in  Table  XVIII  show  that  for  total  test  scores,  each 
prediction  was  correct.  Thus  the  Progressive  Matrices 
showed  a  negligible  change  in  correlation  with  socio¬ 
economic  status  between  Grades  III  and  VII,  while  CTMM 
Total  score  showed  a  significant  increase.  Inspection 
of  the  three  component  scores  on  the  right  of  Table  XVIII 
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TABLE  XVIII 

PEARSON  CORRELATION  COEFFICIENT  BETWEEN  SOCIO¬ 
ECONOMIC  STATUS  AND  FIVE  INTELLIGENCE  TESTS 
AND  SUB TESTS  ADMINISTERED  ON  TWO 
OCCASIONS  FOUR  YEARS  APART 


(N  =  243) 

Grade 

S.E.S. 

and 

Matrices 

S.E.S. 

and 

CTMM 

Total 

S.E.S. 

and 

CTMM 

Lang. 

S.E.S. 

and 

CTMM 

Non-Lang. 

S.E.S. 

and  CTMM 

Ve rbal 

III. 

.035 

.215 

.276 

.094 

.220 

VII. 

.100 

.323 

.350 

.136 

.320 

Difference 

.015 

.108 

.074 

.042 

.100 

t 

0.15 

1.89 

1.19 

0.57 

1.33 

Signif . 

N.S. 

.05 

N.S. 

N.S. 

N.S. 

. 
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suggests  that  the  Verbal  sub-test  contributed  a  large 
proportion  of  the  increase  in  correlation,  although 
none  of  the  sub-test  differences  was  large  enough  to 
be  significant. 

The  third  deduction,  that  socio-economic  status 
should  show  a  higher  correlation  with  change  in  T  score 
for  the  CTMM  than  for  the  Matrices  tests  was  examined 
by  the  same  procedure.  Table  XIX  shows  that  each  result 
was  in  the  expected  direction,  but  that  the  differences 
failed  to  reach  the  significance  level.  Evidently, 
this  approach  is  less  sensitive  to  change  with  age  than 
is  the  straightforward  comparison  of  correlation  coef¬ 
ficients  at  each  age  level.  In  fact,  only  the  corre¬ 
lation  between  CTMM  Verbal  Change  and  socio-economic 
status  is  large  enough  to  be  significantly  different 
from  zero. 

Nevertheless,  the  confirmation  of  the  first  two 
predictions  of  Hypothesis  III  indicates  that  the  Prog¬ 
ressive  Matrices  test  is  less  susceptible  than  the  CTMM 
to  the  cumulative  effects  of  different  socio-economic 
variables  which  may  influence  the  mental  development  of 
Edmonton  children  over  a  period  of  four  years. 
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TABLE  XIX 

PEARSON  CORRELATIONS  BETWEEN  SOCIO-ECONOMIC 
STATUS  AND  CHANGE  IN  T  SCORE  OVER  FOUR 
YEARS  ON  FIVE  INTELLIGENCE 
TESTS  AND  SUB-TESTS 


(N  =  243) 

Test 

Raven 

Matrices 

CTMM 

Total 

CTMM 

Lang. 

CTMM 

Non- 

Lang. 

CTMM 

Verbal 

r  S.E.S . 
and 

T  §core 
Change 

.025 

.101 

.051 

<,015 

.111 

- 


. 

-■ 
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CHAPTER  XI 


ITEM  ANALYSIS  IN  RELATION  TO 
SOCIO-ECONOMIC  STATUS 

Procedure .  To  examine  the  fourth  hypothesis, 
that  a  culture-reduced  test  would  show  a  more  regular 
pattern  of  item  discrimination  across  socio-economic 
levels  than  a  culture-loaded  measure,  two  instruments 
typical  of  both  kinds,  were  selected  for  analysis. 

These  were  the  Standard  Progressive  Plat  rices  and  the 
CTMM  (Elementary)  Verbal  sub-test.  Test  papers  were 
separated  into  three  equal  groups,  on  the  basis  of 
socio-economic  score,  and  the  answers  of  pupils  in 
the  high  third  were  compared  with  the  corresponding 
answers  of  the  lowest  third.  Ninety  pupils  were  inclu¬ 
ded  in  each  status  group. 

The  Verbal  test  contained  fifty  multiple  choice 
items,  but  fifteen  were  eliminated  from  the  analysis 
because  more  than  ninety  per  cent  or  fewer  than  ten 
per  cent  of  the  pupils  scored  them  successfully.  Eells 
(1951)  showed  in  a  similar  study,  that  indices  for 
items  such  as  these  at  the  extreme  ends  of  the  diffi¬ 
culty  scale  are  very  unreliable.  Por  the  same  reason, 
only  twenty-eight  of  the  items  in  the  Progressive 
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Matrices  test  were  retained  for  analysis. 

The  percentages  of  pupils  from  each  status  group 
who  passed  the  retained  items  were  calculated  for  each 
test  and  have  been  recorded  in  Table  XX.  Although 
very  easy  items  had  been  eliminated  beforehand,  it  is 
clear  from  this  Table  that  the  average  item  difficulty 
level  was  still  above  fifty  per  cent.  Nevertheless, 
the  range  of  percentages  is  wide  enough  to  ensure  that 
discrimination  indices  are  not  distorted  by  overweight¬ 
ing  any  one  difficulty  level.  Differences  between  per¬ 
centages  are  not  comparable  from  one  level  of  difficulty 
to  another. 

To  reduce  the  effects  of  this  problem,  percent¬ 
age  values  obtained  for  each  item  were  compared  by 
means  of  the  Davis  Discrimination  Index  (Lindquist, 

1951) .  This  scale  is  a  development  of  Flanagan’s 
Discrimination  Index,  which  consists  of  correlations 
between  responses  to  an  item  and  total  test  scores. 

The  Davis  scale  has  the  advantage  of  being  based  on 
the  z  transformation  of  the  correlation  coefficients, 
thus  providing  units  which  are  approximately  equal 
throughout.  Such  indices  are  subject  to  the  statis¬ 
tical  manipulation  required  to  reveal  differences  in 
discrimination  patterns. 
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TABLE  XX 

DISTRIBUTION  OF  SELECTED  TEST  ITEMS  BY 
DIFFICULTY  OF  THE  ITEM  FOR  TWO 
SOCIO-ECONOMIC  GROUPS 


Per  Cent 
of  Group 
Correct 

CTMM  Verbal 

Progressive  Matrices 

Upper  Third 

Lower  Third 

Upper  Third 

Lower  Third 

Above  90 

5 

2 

SO  -  S9 

6 

2 

10 

2 

70  -  79 

3 

7 

4 

11 

60  -  69 

4 

3 

3 

3 

50  -  59 

4 

4 

5 

3 

40  -  49 

4 

2 

4 

30  -  39 

4 

5 

1 

1 

20  -  29 

2 

5 

3 

2 

10  -  19 

3 

6 

2 

Below  10 

1 

Total  Number 

of  Items 

35 

35 

2B 

2S 

Mean  Per  Cent 

59*  /4 

45.75 

69.64 

59.00 

'  L  si  ■ 
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Results .  The  discrimination  indices,  which  were 


read  from  Colver’s  Nomograph  (Colver,  1959)  are  tabu¬ 
lated  in  Table  XXI. 


TABLE  XXI 


DISTRIBUTION  OF  INDICES  OF  DISCRIMINATION 
BE  WEEN  WO  SOCIO-ECONOMIC 
STATUS  GROUPS 


Discrimination  CTMM  Raven 

Index  Verbal  Matrices 


24  -  26 

1 

21  -  23 

18  -  20 

4 

15  -  17 

8 

12  -  14 

6 

6 

9-11 

5 

9 

6-8 

4 

5 

3  -  5 

5 

6 

o 

i 

ro 

1 

2 

-  3  -  -1 

1 

N 


35 


28 


. 

• 

- 
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1’he  hypothesis  predicted  that  the  culture-reduced 
test  would  show  a  more  regular  pattern  of  item  discrim¬ 
ination  between  social  status  levels;  for  instance, 
that  the  discriminatory  powers  from  item  to  item  would 
vary  significantly  less  than  the  discrimination  indices 
of  the  verbal  test.  Inspection  of  Table  XXI  reveals 
that  some  items  of  the  CTMM  test  differentiate  markedly 
between  socio-economic  groups,  while  others  discrimi¬ 
nate  very  little  or  not  at  all.  The  corresponding 
values  for  the  Matrices  test  show  very  little  variation,. 
The  significance  of  these  differences  was  tested 
by  means  of  an  F  test  on  the  variances  of  the  two  dis¬ 
tributions  of  indices.  Table  XX II  shows  that  these 
variances  were  significantly  different  at  the  .05  level 
when  a  one-tailed  test  was  used. 

A  t  test  was  also  performed  on  the  means  of  the 
discrimination  indices  and  another  significant  differ¬ 
ence  obtained.  This  result  merely  confirms  what  the 
correlation  findings  of  Hypothesis  II  indicated,  that 
the  verbal  test  discriminated  more  between  socio¬ 
economic  levels  than  the  Progressive  Matrices. 

The  significant  difference  in  variability  of 
the  indices  is  the  important  finding  of  this  Table.  It 
suggests  that,  in  relation  to  the  upper  classes,  chil¬ 
dren  from  the  lower  social  status  levels  find  some 
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TABLE  XXII 


MEANS  STANDARD  DEVIATIONS  AND  SIGNIFICANCE 
LEVELS  FOR  INDICES  OF  DISCRIMINATION 
BETWEEN  TWO  SOCIO-ECONOMIC 
STATUS  GROUPS 


CTMM 

Verbal 

Prog. 

Matrices 

t 

Signif- 
F  icance 

Number 

of 

Items 

35 

2S 

- 

- 

- 

Mean 

11.34 

£.36 

2.31 

.05 

S.D. 

5.94 

4*  04 

- 

2.16 

.05 

- 


G.  . 
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- 
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items  more  difficult  than  others.  Such  a  result  is 
compatible  with  the  assertion  that  certain  items  in 
the  verbal  scale  are  biased  against  lower  class  chil¬ 
dren.  Caution  is  necessary  in  drawing  any  such  defi¬ 
nite  conclusion,  however,  since  it  may  be  that  the 
verbal  items  differ  also  in  the  extent  to  which  they 
tap  the  main  dimension  of  the  test. 

Examination  of  the  individual  items  which  dis¬ 
criminate  most  between  social  levels  suggests  that  each 
may  well  require  a  knowledge  of  abstract  terms  which 
are  more  familiar  in  a  home  where  parents  have  some  high 
degree  of  verbal  sophistication.  Such  a  speculation  is 
in  line  with  Eells’  (1951)  findings  about  the  nature  of 
the  items  in  which  social  class  discrimination  is  great¬ 
est.  No  proof  of  this  speculation  seems  possible  however; 
If  it  is  the  case,  and  the  empirical  finding  of  greater 
variability  in  item  discrimination  makes  it  plausible, 
it  could  be  argued  that  the  items  with  greatest  discrep¬ 
ancies  between  social  levels  are  biased  against  the 
lower  social  groups.  As  tests  of  present  vocabulary, 
their  validity  is  unimpaired,  but  as  an  indication  of 
the  student’s  intellectual  potential,  they  are  open  to 
criticism. 

Table  2XIII  presents  examples  of  the  most  and 
least  discriminating  items,  to  enable  the  reader  to  draw 
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TABLE  XXIII 


EXAMPLES  OF  MOST  AND  LEAST  DISCRIMINATING 
VOCABULARY  ITEMS  FROM  THE 
CTMM  VERBAL  SUB -TEST 


Vocabulary 

Item 

! 

Per  Cent 
Passing 
in  Upper 
Third 

Per  Cent 
Passing 
in  Lower 
Third 

Difference 
in  Per  Cent 
Favouring 
Upper  Third 

Most 

Significance 

65 

35 

30 

Discri- 

Complex 

57 

23 

29 

minat- 

ing 

Obscure 

41 

15 

26 

Malign 

41 

20 

21 

Least 

Dispatch 

11 

14 

-3 

Discri- 

Console 

39 

35 

4 

minat- 

Lofty 

69 

62 

7 

ing 

Conclusion 

BO 

73 

7 

Number  of  Students 

90 

31 

,  :  .  .  ;  . 
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his  own  conclusions  on  this  point. 

The  findings  of  this  chapter  support  Kells’ 
conclusion  that  vocabulary  items  discriminate  more 
irregularly  between  social  status  groups  than  do 
geometric-design  items.  These  results  confirm 
the  view  that  if  some  items  are  unsuitable  for 
estimating  the  intellectual  ability  of  lower  class 
children,  they  are  more  likely  to  be  of  a  verbal 


nature . 
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CHAPTER  XII 


ADDITIONAL  FINDINGS 

Previous  chapters  have  outlined  four  different 
attempts  to  evaluate  culture-reduced  intelligence  tests. 
As  Cronbach  (I960)  points  out,  construct  validation  can 
be  approached  by  a  variety  of  methods.  For  intelli¬ 
gence  tests,  an  obvious  criterion  is  that  of  evaluation 
against  school  attainment  in  concurrent  and  predictive 
situations • 

It  has  already  been  pointed  out  that  "cold-blooded 
predictive  validity  is  a  less  suitable  evaluation  stand¬ 
ard  for  this  investigation  since  the  criteria  to  which 
prediction  is  made  are  culturally  weighted.  Only  when 
special  remedial  treatment  has  been  carried  out  or 
cultural  differences  held  constant  would  predictive  or 
concurrent  validity  in  terms  of  school  marks  be  crucial. 
However,  the  effects  of  cultural  weighting  are  not  so 
great  that  a  culture-reduced  test  is  relieved  of  the 
necessity  for  showing  a  reasonably  high  relationship 
with  school  marks,  which  presumably  do  reflect  some 
measure  of  intellectual  development.  Therefore,  re¬ 
lationships  with  average  school  marks  and  standardized 
achievement  tests  will  now  be  examined. 
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Relationship  with  Average  School  Marks .  Shortly 
after  the  Grade  VII  testing  program  was  completed,  end- 
of-year  examinations  were  administered  by  the  school 
authorities.  These  completed,  the  aggregate  school 
marks  for  the  year’s  work  were  collected  by  the  inves¬ 
tigator  for  comparison  with  intelligence  test  data. 

These  marks  were  based  on  examinations  which  were  con¬ 
structed  by  teachers.  Since  the  examinations  were  not 
identical  for  all  classes  in  one  of  the  schools,  and 
since  marks  based  on  term  assignments  were  also  inclu¬ 
ded  in  the  totals,  it  was  decided  to  calculate 
correlations  for  each  class  separately,  and  to  use  the 
median  correlations  with  intelligence  tests  as  a  typical 
index  of  the  relationship  between  each  measure  of 
intelligence  and  scholastic  achievement  as  defined  by 
the  teachers. 

The  average  school  mark  recorded  on  each  pupil’s 
report  card  was  taken  as  a  representative  indication  of 
each  pupil’s  academic  status  in  his  class.  This  average 
was  in  percentage  form  and  was  based  on  equal  weights 
for  all  the  following  subjects;  reading,  literature, 
language,  social  studies,  science,  mathematics,  art, 
music,  health  and  personal  development,  and  industrial 
arts  or  home  economics.  Thus,  it  could  be  regarded  as 
a  composite  index  of  each  pupil’s  achievement  on  the 
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basis  of  judgments  made  by  teachers  in  ten  subjects 
over  a  whole  school  year. 

The  median  correlation  between  this  average 
school  mark  and  each  intelligence  test  was  calculated 
for  the  thirteen  classes  included  in  the  main  sample, 
and  the  results  recorded  in  Table  XXIV. 

From  these  results  it  can  be  seen  that,  on  the 
whole,  the  validity  of  the  culture-loaded  CTMM  and 
Laycock  tests  is  better  than  that  of  the  culture- 
reduced  tests.  Such  a  result  is  to  be  expected  when 
it  is  considered  that  verbal  intelligence  tests  are 
often  validated  against  criteria  of  school  achievement. 
However,  the  margin  of  difference  is  quite  small  in  the 
case  of  the  Progressive  Matrices  and  Holzinger-Crowder 
Series  Tests.  The  results  for  the  Cattell  and  Lorge- 
Thorndike  tests  are  less  impressive,  but  it  should  be 
remembered  that  the  criteria  they  are  predicting  to 
are  heavily  weighted  culturally.  It  was  noticed  that 
correlations  between  average  school  marks  and  socio¬ 
economic  status  were  over  .5  in  five  instances. 

Therefore,  the  predictive  power  of  the  culture- 
reduced  tests  is  lower  than  that  of  the  verbal  tests, 
but  the  difference  is  not  great  enough  to  discourage 
their  use  with  children  who  are  handicapped  verbally, 
particularly  if  measures  to  reduce  the  handicap  are 
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TABLE  XXIV 


CORRELATION  COEFFICIENTS  BETWEEN  END-OF-YEAR 
AVERAGE  SCHOOL  MARK  AND  SELECTED 
INTELLIGENCE  TESTS 


School  Class  No 

*  1 

(Decimal  points  omitted) 

23456739  10 

11 

12 

13 

Median 

N 

20 

14 

23 

13 

9 

19 

27 

23 

25 

23 

23 

21 

20 

271 

Raven  Matrices 

53 

31 

60 

52 

56 

66 

37 

07 

59 

57 

43 

19 

33 

56 

Cattell  g 

23 

23 

55 

25 

50 

63 

35 

34 

23 

50 

53 

03 

33 

34 

Lorge-Thorn. 
Fig.  Cl. 

34 

60 

40 

09 

07 

61 

26 

14 

50 

31 

10 

-11 

33 

31 

Lorge -Thorn. 
Series 

51 

61 

63 

35 

39 

63 

45 

61 

23 

66 

34 

06 

13 

45 

Lorge-Thorn. 

Fig.  Anal. 

42 

52 

49 

27 

51 

42 

32 

07 

55 

45 

47 

-13 

15 

42 

Lorge-Thorn. 

Total 

51 

64 

59 

33 

43 

71 

37 

29 

55 

64 

42 

-06 

32 

43 

Holz . -Crowder 
Series 

30 

65 

44 

53 

62 

71 

12 

15 

23 

32 

14 

-25 

-02 

51 

Holz . -CrowdBF 
Fig. Changes 

37 

53 

32 

32 

90 

59 

51 

51 

60 

31 

33 

10 

37 

40 

CTMM  Non -Lang. 

55 

75 

40 

35 

67 

69 

31 

07 

33 

29 

57 

20 

31 

33 

CTMM  Total 

66 

91 

73 

72 

53 

76 

59 

33 

59 

59 

74 

71 

66 

66 

Laycock  Total 

72 

36 

67 

59 

60 

66 

51 

39 

55 

53 

64 

43 

40 

59 

CTMM  Language 

54  85 

79 

72 

44 

73 

67 

41 

53 

64  69 

70 

67 

67 
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Relationships  with  Standa rdized  Achievement  Tests . 
The  California  Achievement  tests,  which  were  administered 
twelve  months  before  the  Grade  VII  battery,  were  also 
used  to  illustrate  the  relationship  between  the  various 
intelligence  tests  and  school  achievement.  For  the  sample 
of  two  hundred  seventy-one  pupils,  total  scores  on  read¬ 
ing,  arithmetic,  and  English  were  correlated  with  the 
nine  culture-reduced  tests  and  sub-tests.  The  resulting 
coefficients  and  the  means  for  all  intelligence  teste  are 
recorded  in  Table  XXV.  The  means  were  calculated  by 
conversion  to  Fisher’s  z  function.  Reading  total  is  based 
on  Vocabulary  and  Comprehension  sub-tests,  Arithmetic 
Total  on  Reasoning  and  Fundamentals  sub-tests  and  English 
Total  on  Spelling  and  Mechanics  of  English. 

It  is  noticeable  that  the  Laycock  and  CTMM  Total 
scores  show  a  much  higher  relationship  with  Reading  and 
English  than  do  the  culture-reduced  tests.  Since  these 
two  subject  totals  show  correlations  of  .36  and  .27 
respectively  with  socio-economic  status,  the  result  is  not 
unexpected.  In  arithmetic,  the  superior  predictive  power 
of  the  verbal  tests  is  less  marked. 

Of  the  nine  culture-reduced  tests,  the  Holzinger- 
Crowder  Series  Test,  the  Lorge-Thorndike  Total  and  Series 
Sub-test,  and  the  Progressive  Matrices  all  show  mean  r’s 
above  .4.  Since  the  first  three  have  numerical  loadings 
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TABLE  XXV 

PEARSON  CORRELATIONS  BETWEEN  INTELLIGENCE  TESTS 
AND  THREE  STANDARDIZED  ACHIEVEMENT  TESTS 


ADMINISTERED 

TWELVE 

MONTHS  . 

APART 

Test 

Reading 

Total 

Arith 

Total 

English 

Total 

Mean 

r 

Raven  Matrices 

.43 

•  44 

.33 

o41 

Cattell  g 

.36 

.37 

.30 

.35 

Lorge-Thorndike  Fig. Class 

.23 

.37 

.23 

.31 

Lorge-Thorndike  Series 

.41 

.43 

.34 

.41 

Lorge-Thorndike  Fig. Anal. 

.41 

.41 

.34 

.39 

Lorge-Thorndike  Total 

.47 

.53 

.41 

.47 

Holz-Crowder  Series 

.49 

.50 

.47 

.49 

Holz-Crowder  Fig.  Class 

.42 

.39 

.36 

.39 

CTMM  Non- Language 

.35 

.43 

.35 

.33 

C  TMM  Total 

.72 

.60 

.61 

.65 

Laycock  Total 

.71 

.55 

.64 

.64 

CTMM  Language 

.77 

.56 

.62 

.66 

164 

and  show  higher  correlations  with  socio-economic  status, 
the  remaining  test,  the  Progressive  Matrices,  must  be  con¬ 
sidered  amongst  the  best  of  the  testsexamined  for  the 
purposes  of  this  investigation.  Again,  the  Cattell  and 
Lorge-Thorndike  Figure  Analogies  tests  show  lower  correl¬ 
ations  with  each  achievement  test,  but  their  inferiority 
to  the  Progressive  Matrices  in  this  respect  is  not  signi¬ 
ficant.  In  fact,  Table  VII  indicates  that  the  Figure  Anal¬ 
ogies  test  shows  slightly  higher  correlations  with  three 
sub-tests  of  the  California  Achievement  Battery  than  does 
the  Progressive  Matrices. 

Long  Term  Prediction.  Another  relevant  finding  was 
seen  in  the  long  term  predictive  validity  of  the  Coloured 
Progressive  Matrices.  For  the  three  schools  in  the  study, 
the  correlation  coefficients  between  the  Matrices  given  at 
Grade  III  and  the  average  school  mark  described  above  was 
calculated.  These  coefficients  and  their  medians  are 
recorded  in  Table  XXVI. 

These  results  should  be  interpreted  with  caution, 
since  the  examinations  given  in  Grade  VII  were  not  ident¬ 
ical  for  each  class  in  School  C.  Nevertheless,  the  assum¬ 
ption  that  they  could  be  considered  similar  enough  to  make 
comparisons  meaningful  is  acceptable  to  the  teachers  and 
parents  of  the  children.  Certainly  the  assumption  was  less 
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TABLE  XXVI 

PEARSON  CORRELATION  COEFFICIENTS  BE 'TWEEN  END-OF-YEAR 
SCHOOL  AVERAGE  MARK  IN  GRADE  VII  AND  SCORES  ON 
INTELLIGENCE  TESTS  ADMINISTERED 
FOUR  YEARS  PREVIOUSLY 


School 

A 

B 

C 

Median 

r 

N 

67 

105 

71 

243 

Raven  Matrices 

.33 

.35 

.35 

.35 

CTMM  Language 

.31 

,48 

.23 

.31 

CTMM  Non -Language 

.32 

.23 

.15 

.23 

CTMM  Total 

.39 

.43 

.25 

.39 

CTMM  Verbal 

.20 

.19 

.11 

.19 
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questionable  for  Schools  A  and  B,  where  the  same 
examinations  were  written  by  all  children „ 

Although  all  these  coefficients  are  low  they  are 
in  line  with  the  assertion  that  culture-reduced  intel¬ 
ligence  tests  may  not  be  inferior  to  culture-loaded 
tests  when  prediction  is  to  a  distant  and  changing 
criterion.  Thus,  the  Matrices  test  showed  a  higher 
coefficient  than  the  CTMM  Language  and  Verbal  scores, 
and  was  neearly  as  high  as  the  Total  Score.  The  latter 
tests  were  slightly  better  predictors  at  the  Grade  VII 
level,  but  requirements  of  school  success  had  probably 
changed  enough  between  Grades  III  and  VII  for  these 
culture-loaded  tests  to  lose  much  of  their  advantage 
in  long-term  prediction. 

Stability  of  Scores .  Another  finding  worthy  of 
record  is  that  of  the  comparative  stability  of  the  CTMM 
and  Progressive  Matrices  scores  between  Grades  III  and 
VII.  The  correlations  between  the  corresponding  pairs 
of  tests  are  set  out  in  Table  XXVII. 

Although  the  superiority  of  the  Matrices  to  the 
CTMM  Total  and  Language  scores  is  not  significant,  the 
coefficients  presented  indicate  that  an  intelligence 
quotient  based  on  the  Progressive  Matrices  would  be  at 
least  as  stable,  if  not  more  so,  than  an  intelligence 
quotient  obtained  from  the  CTMM. 
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TABLE  XXVII 

PEARSON  CORRELATION  COEFFICIENTS  BETWEEN 
CORRESPONDING  INTELLIGENCE 
TESTS  ADMINISTERED  IN 


GRADES 

III  AND 

VII 

(N  =  243) 

Raven  Matrices 

.55 

CTMM 

Total 

.47 

CTMM  Language 

.49 

CTMM 

Verbal 

.22 

CTMM  Non-Language 

.33 
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Summary.  The  following  findings  have  been 
presented  in  this  chapter. 

1.  Verbal  intelligence  tests  show  only  slight 
superiority  to  the  Progressive  Matrices  and  Lorge- 
Thorndike  Tests  in  their  ability  to  predict  to  average 
end-of-year  school  marks  at  the  Grade  VII  level. 

2.  At  the  Grade  VII  level,  verbal  intelligence 
tests  show  a  significantly  higher  relationship  with 
Grade  VI  standardized  achievement  tests  in  reading, 
arithmetic,  and  English.  Amongst  the  culture-reduced 
tests,  the  Holzinger-Crowder  Series,  the  Lorge-Thorn- 
dike  Series  and  Total  scores,  and  the  Progressive 
Matrices  showed  the  highest  relationships  with  these 
measures  of  achievement. 

3.  The  Coloured  Progressive  Matrices  and  the 
CTMM  Primary  both  administered  in  Grade  III,  showed 
equal  ability  to  predict  to  Grade  VII  end-of-year 
average  school  marks. 

4.  The  correlations  between  appropriate  forms 
of  the  Progressive  Matrices  administered  four  years 
apart  were  at  least  as  high  as  the  corresponding  cor¬ 
relations  for  the  CTMM  test. 
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CHAPTER  XIII 


EVALUATION  OF  THE  CULTURE-REDUCED  TESTS 

The  findings  of  the  previous  five  chapters  must 
now  be  drawn  together  in  order  to  evaluate  the  culture- 
reduced  tests  studied  in  this  investigation. 

Criteria  of  Evaluation.  In  Postulate  10, 

Chapter  V,  seven  criteria  for  evaluating  culture- 
reduced  intelligence  tests  were  listed.  Each  criterion 
enables  more  light  to  be  thrown  on  the  construct  val¬ 
idity  of  each  test.  However,  it  is  apparent  that  some 
test  characteristics  are  more  important  than  others. 
Thus,  a  test  which  satisfies  all  criteria  except  high 
g  loading  would  be  of  little  value  as  a  test  of  gen¬ 
eral  intelligence. 

In  view  of  the  aims  of  the  investigator,  the  two 
most  crucial  standards  of  evaluation  would  appear  to  be 
high  g  loading  and  low  correlation  with  socio-economic 
status,  the  former  to  indicate  sampling  of  an  important 
aspect  of  intelligence,  the  latter  to  indicate  lack  of 
cultural  loading. 

The  remaining  criteria,  although  of  less  impor¬ 
tance,  are  likely  to  support  the  two  major  standards, 
and  should  be  used  in  the  final  evaluation  of  each  test. 
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Lack  of  group  or  environmental  factors,  such  as  verbal, 
numerical  and  educational  dimensions,  is  considered 
important,  since  these  factors  appear  more  sensitive 
to  environmental  change.  Test  materials  should  be 
equally  unfamiliar  to  all  testees,  and  hence  depend  as 
little  as  possible  on  acquired  information.  Medium 
correlations  with  school  achievement  are  considered 
necessary,  if  the  tests  are  to  deserve  the  name  intel¬ 
ligence.  However,  less  weight  can  be  assigned  to  the 
latter  criterion,  since  "cold-blooded"  prediction  to 
culturally  weighted  criteria  is  not  in  line  with  the 
objectives  of  this  study.  The  criteria  of  change  in 
relationship  with  socio-economic  status  over  four 
years,  and  of  variability  in  item  discrimination 
indices  calculated  across  socio-economic  levels  have 
been  used  to  evaluate  only  the  Progressive  Matrices. 

Table  XXVIII  lists  the  nine  culture-reduced  tests 
examined  in  this  investigation  and  shows  the  extent  to 
which  each  meets  the  first  six  criteria  listed  above. 

It  should  be  pointed  out,  however,  that  some  caution 
is  necessary  in  the  interpretation  of  this  Table.  As 
suggested  above,  equal  weight  cannot  be  assigned  to 
all  columns.  Moreover,  a  loading  on  the  group  factor 
of  speed,  in  the  second  column,  would  be  less  serious 
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for  a  test  than  one  on  a  verbal  or  numerical  factor. 
Presumably,  different  test  administration  conditions 
could  well  reduce  the  speed  loading,  but  a  verbal  di¬ 
mension  could  not  be  so  easily  diminished. 

If  most  weight  is  accorded  the  criteria  of  high 
g  loading  and  low  correlation  with  social  status,  while 
the  last  two  columns,  denoting  relationships  with 
present  school  achievement,  are  played  down,  then  the 
Table  can  serve  a  useful  purpose  in  presenting  many  of 
the  relevant  findings  of  this  investigation  in  one 
place,  and  in  facilitating  the  evaluation  of  each  test. 

It  can  be  seen  that  four  of  the  nine  tests  showed 
g  loadings  over  .70  and  all  were  above  .45.  The  Prog¬ 
ressive  Matrices,  Cattell  Test  of  g,  the  Lorge-Thorndike 
Figure  Analogies  and  Total  scores  can  be  interpreted  as 
the  best  indicators  of  the  general  factor  common  to  the 
battery  used.  The  latter  test,  along  with  four  others, 
showed  a  numerical  loading  which  confirmed  the  belief 
that  they  depend  to  some  extent  upon  a  dimension  which 
is  affected  more  by  past  learnings  than  the  other 
instruments  studied.  The  speed  factor,  as  mentioned 
above,  should  not  count  heavily  against  the  tests 
showing  it. 

All  the  tests  except  the  Holzinger-Crowder 
Series  revealed  significantly  lower  correlations  with 
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SUMMARY  OF  NINE  CULTURE-REDUCED  TESTS 
MEASURED  AGAINST  SIX  CRITERIA 
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Socio-economic  status  than  did  conventional  verbal 
intelligencetests ,  some  to  a  greater  degree  than  others. 
The  Lorge -Thorndike  Total  and  Figure  Analogies  tests 
showed  one  less  significant  difference,  while  the  CTMM 
Non-Language  and  the  first  two  sub-tests  of  the  Lorge- 
Thorndike  test  showed  one  more  significant  difference 
than  the  remainder. 

Typical  correlations  with  average  school  marks 
and  standardized  achievement  tests  range  from  .31  to 
.56.  The  best  predictors  in  this  category  appear  to 
be  the  less  culture-reduced  Holzinger-Crowder  Series 
Test  and  the  more  promising  Progressive  Matrices. 

The  Lorge-Thorndike  Total  and  Series  scores  are  also 
relatively  high,  but  the  Cattell  test  seems  less  use¬ 
ful  in  a  Tfcold-blooded,T  prediction  situation. 

Appraisal  of  Bach  Test.  Each  culture-reduced 
test  will  now  be  evaluated  separately. 

1.  Standard  Progressive  Matrices .  As  a  culture- 
reduced  measure  of  intelligence,  this  test  appears  to 
have  more  promise  than  any  other  in  the  battery.  For 
the  sample  tested  it  showed  one  of  the  highest  g  load¬ 
ings,  a  finding  which  is  supported  by  many  other 
studies  (Burke,  1953).  It  is  one  of  the  four  tests  to 
show  no  group  factor  loadings  over  .2  and  yet  it  shows 
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relatively  high  correlations  with  school  achievement. 
The  low  correlation  (.23)  with  socio-economic  status 
is  in  line  with  NisbetTs  (1953)  finding  that  it  is 
relatively  independent  of  the  environmental  influences 
associated  with  family  size.  The  two  Matrices  tests 
showed  no  change  in  relationship  with  socio-economic 
status  over  four  years,  a  finding  which  supports  their 
freedom  from  cultural  loading.  Furthermore,  the 
ability  of  the  Coloured  Progressive  Matrices  to  pre¬ 
dict  Grade  VII  average  school  achievement  from  a 
Grade  III  test  was  as  good  as  that  of  a  verbal  intel¬ 
ligence  test. 

The  item  discrimination  indices  of  the  Stand¬ 
ard  Matrices  computed  across  socio-economic  levels 
were  relatively  stable,  another  finding  indicative  of 
its  comparative  culture-fairness.  Finally,  the  test 
has  good  face  validity,  is  intrinsically  interesting, 
requires  few  initial  instructions  and  generally  shows 
reliability  coefficients  above  .85. 

In  view  of  these  considerations  the  Progressive 
Matrices  must  be  regarded  as  a  potentially  useful 
culture-reduced  intelligence  test,  quite  suitable  for 
estimating  the  intellectual  ability  of  Grade  VII  chil¬ 
dren  from  low  socio-economic  levels  in  an  urban  setting 
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such  as  that  of  Edmonton.  Whether  it  would  stand  up 
to  urban-rural  and  inter-racial  comparisons  equally 
well  is  a  matter  for  further  study.  The  findings  of 
Rimoldi  (194$)  on  Argentine  children,  and  of  Nisbet 
(1953)  on  environmental  influences  suggests  that  it 
may  well  do  this. 

2.  Cattell  Test  of  g  Culture-Free ,  Scale  2. 

This  test  appears  to  conform  well  to  the  requirements 
of  its  constructor  by  virtue  of  its  high  g  loading  and 
its  relative  freedom  from  cultural  influences.  The 
loadings  on  the  general  factor  are  the  highest  in  each 
battery  analyzed,  and  the  correlation  of  the  test  with 
socio-economic  status  is  significantly  lower  than  that 
of  the  verbal  tests  used  for  comparison.  Face  validity 
is  good,  but  some  administration  problems  were  exper¬ 
ienced  in  making  pupils  understand  what  was  required 
in  the  fourth  sub-test.  Re-organization  of  instruc¬ 
tions  and  item  analyses  may  well  improve  this  test, 
particularly  in  its  relation  to  school  achievement, 
which  is  not  as  close  as  that  shown  by  most  of  the 
other  tests.  Furthermore,  the  addition  of  Form  2 
would  probably  increase  both  the  reliability  and 
validity  of  this  rather  short  twenty-minute  test. 

The  indications  derived  from  this  investigation  are 
promising  enough  to  suggest  further  use  and  study. 
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3 .  Lorge-Thorndike  Figure  Classification 
Sub-Test.  ihis  is  one  of  the  least  valuable  of  the 
tests  examined.  Although  its  freedom  from  cultural 
bias  is  su  ;gested  by  its  factor  pattern  and  low  cor¬ 
relation  with  social  status,  the  g  loading  of  .5$  is 
not  very impressive ,  and  the  correlations  of  .31  with 
school  achievement  even  less  so.  In  fact,  examina¬ 
tion  of  the  first  and  last  two  columns  of  Table  XXVIII 
suggests  that  it  probably  adds  little  to  the  value  of 
the  Lorge-Thorndike  Total  score.  The  low  factor  load¬ 
ings  and  communality  are  in  line  with  Rosner’ s  findings 
in  a  factor  analysis  of  the  Level  3  tests  reported  in 
the  Lorge-Thorndike  Manual  (1937). 

4.  Lorge-Thorndike  Number  Series  Sub-Test.  This 
test  is  only  slightly  more  valuable  for  the  purposes  of 
this  study.  The  dependence  on  familiarity  with  numbers 
showed  up  in  the  factor  analysis  as  a  moderately  high 
numerical  loading.  However,  the  low  correlation  with 
socio-economic  status  suggests  that  this  numerical 
dimension  may  not  be  a  serious  disadvantage  in  relation 
to  culture  as  defined  in  this  study.  For  children  with 
little  experience  with  numbers,  it  may  prove  more  of  a 
handicap. 

The  chief  weakness  of  this  test  was  seen  in  its 
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low  g  loading  (.55),  while  its  relationship  with 
school  marks,  despite  the  numerical  factor,  was  not 
as  high  as  that  of  the  Progressive  Matrices.  A  test 
designed  on  similar  lines,  but  using  less  familiar 
symbols,  could  possibly  find  a  place  in  a  composite 
culture-reduced  intelligence  testo 

5.  Lorge-  i'hornd  ike  Figure  Analogies  Sub-Test . 
This  must  be  considered  amongst  the  three  best  tests 
studied  in  this  investigation.  It  showed  a  g  loading 
of  .74,  no  group  factor  loadings,  and  a  sufficiently 
low  correlation  with  socio-economic  status  (.26)  to 
suggest  freedom  from  cultural  influence.  It  has  good 
face  validity,  takes  only  nine  minutes  to  administer, 
and  yet  it  predicts  school  marks  better  than  the 
Cattell  test,  and  nearly  as  well  as  the  Progressive 
Matrices.  If  the  number  of  items  (thirty)  was  in¬ 
creased,  or  if  they  were  included  in  a  composite  test, 
it  would  surely  improve  both  reliability  and  validity. 
The  high  g  loading  is  supported  by  the  corresponding 
figure  of  .80  cited  in  the  Lorge-Thorndike  Manual 
(1957)  for  the  Level  3  test.  Apparently,  the  figure 
analogies  item  is  worthy  of  intensive  study. 

6.  Lorge- ihorndike  Total9  As  a  whole,  this  test 
conformed  relatively  well  to  the  criteria  of  this  study. 
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It  showed  a  g  loading  of  ,75  and  was  amongst  the  best 
predictors  of  school  achievement.  The  correlation  with 
socio-economic  status  was  significantly  lower  than 
that  of  the  verbal  tests,  but  the  numerical  test  added 
a  group  factor  loading  which  suggests  more  dependence 
on  environmental  acquisitions  than  is  desirable  for 
culture-reduction  purposes. 

Since  the  g  loading  was  no  better  than  that  of 
the  third  sub-test  and  its  correlations  with  school 
marks  similar  to  those  of  the  second  and  third  sub¬ 
tests,  it  is  possible  that  the  omission  of  the  first 
sub-test  would  not  harm  this  test.  In  fact,  the  cor¬ 
relations  of  the  third  sub-test  with  the  total  score 
was  o$6,  which  was  high  enough  to  suggest  that  much 
of  the  merit  that  the  total  test  shows  could  well  be 
ascribed  to  the  promising  figure  analogies  items. 
Verification  of  this  hypothesis  could  come  only  from 
further  item  analyses. 

7.  Holzinger-Crowder  Mixed  Series.  This  test 
was  probably  the  least  culture-reduced  of  the  nine 
examined.  It  showed  the  highest  correlations  with 
school  marks,  but  conformed  to  no  other  criteria  of 
evaluation.  Its  g  loading  was  relatively  low  (.46), 
its  correlation  with  socio-economic  status  was  not 
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significantly  different  from  those  of  the  verbal  tests, 
and  it  showed  loadings  on  three  group  factors.  Its 
emphasis  on  familiarity  with  numbers  and  letters  of 
the  alphabet  gives  it  a  dependence  on  past  learning 
which  is  probably  greater  than  that  of  any  of  the 
nine  tests.  Evidently,  the  mixed  series  item  is  not 
suitable  for  the  purposes  of  this  study. 

8.  Holzinger-Crowder  Figure  Changes  Test .  This 
test  was  shown  to  be  considerably  less  culture-loaded 
than  the  last.  Its  correlation  with  socio-economic 
status  was  promisingly  low  (.22)  and  its  group  factor 
loadings  were  not  seen  as  detrimental.  However,  its 

g  loading  was  unimpressive  (.52)  and  its  correlation 
with  school  marks  lower  than  could  be  desired.  Face 
validity  is  good,  and  the  similarity  of  the  items  to 
those  of  the  Lorge-Thorndike  Figure  Analogies  suggests 
that  the  test  might  have  shown  more  promise  if  the 
speed  element  had  not  been  so  pronounced.  The  test 
was  designed  for  Grades  VII  to  XII,  and  may  well 
prove  more  suitable  for  older  children  than  those 
used  in  this  study. 

9.  CTI’-'M  Non-Language  Total.  This  score  Inclu¬ 
des  two  short  spatial  tests,  a  logical  classification 
test  and  a  ten  item  number  series  test.  The  last  two 
seem  to  depend  on  past  learning,  but  the  correlation 
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with  socio-economic  status  is  low  enough  (.13)  to  sug 
gest  no  cultural  loading,  as  defined  in  this  study. 
The  g  saturation  of  this  test  ranked  fifth  (.62)  in 
the  eleven  test  analysis,  but  the  correlations  with 
school  marks  were  amongst  the  lowest.  This  could  be 
attributed,  in  part,  to  the  influence  of  the  spatial 
scores,  which  usually  show  little  relationship  with 
school  achievement,  and  which  in  this  study  appeared 
to  have  a  very  low  ceiling.  If  this  test  has  further 
merit  as  a  culture-reduced  measure  of  intelligence, 
it  was  not  very  apparent  in  the  results  of  this  inves 
tigation.  It  was  designed  for  use  with  Grades  IV  to 
VIII,  but  would  appear  to  be  more  suitable  for  elemen 
tary  school  children. 

Summary,,  Nine  culture-reduced  intelligence 
tests  have  now  been  evaluated  against  a  number  of 
relevant  criteria.  On  the  basis  of  the  findings  re¬ 
ported,  the  Progressive  Matrices  Test  shows  the  great 
est  potential,  since  it  meets  all  the  criteria  of 
evaluation.  The  Cattell  Test  of  g  and  the  Lorge- 
Thorndike  Figure  Analogies  Tests  must  be  given  com¬ 
parable  honours,  since  they  fulfilled  nearly  all  the 
requirements  in  less  than  half  the  testing  time.  As 
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they  stand,  each  of  these  three  measures  can  be 
regarded  as  a  useful  test  for  the  purpose  of  this 
study.  However,  a  combination  of  the  merits  of  each 
test  in  one  composite  could  well  increase  the  separ¬ 
ate  reliability  and  validity  of  each  instrument  to 
the  point  where  a  potentially  valuable  estimate  of 
intellectual  ability  was  made  possible. 

Of  the  remaining  six  tests,  the  Holzinger- 
Crowder  Figure  Changes  test  appears  promising  for 
older  children,  while  the  CTMM  Non-Language  Total 
may  improve  its  standing  on  younger  pupils.  The 
other  tests  must  be  described  as  less  useful  for 
the  purposes  of  this  investigation,, 
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CHAPTER  XIV 


SUMMARY, CONCLUSIONS,  AND  IMPLICATIONS 

This  investigation  was  designed  to  investigate 
the  extent  of  socio-economic  bias  in  selected  intel¬ 
ligence  tests  with  a  view  to  identifying  tests  which 
would  estimate  general  intellectual  ability  with  a 
minimum  of  such  bias0 

Intelligence  was  regarded  statistically  as  a 
hierarchical  construct  inferred  from  an  analysis  of 
test  scores.  Loadings  on  the  broad  general  intellec¬ 
tual  factor,  g,  which  commonly  accounts  for  a  large 
proportion  of  ability  and  achievement  test  variance, 
were  taken  as  a  major  criterion  of  the  construct 
validity  of  the  tests  examined.  The  extent  of  cul¬ 
tural  loading  was  determined  by  such  criteria  as 
relationship  with  socio-economic  status  at  different 
age  levels,  dependence  on  acquired  knowledge,  and 
patterns  of  item  discrimination  across  socio-economic 
levels. 

Socio-economic  status  was  defined  operationally 
as  the  variable  measured  by  a  Canadian  occupational 
scale  derived  by  Blishen  (195$)  from  census  data,  and 
a  Home  Index  Scale  of  twenty  items  adapted  from; 
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a  similar  measure  constructed  by  Gough  (1949).  Results 
from  these  two  scales  were  combined  with  equal  weights 
and  a  useful  measure  of  socio-economic  status  produced,, 

Nine  promising  culture-reduced  intelligence 
tests  and  sub-tests  were  administered  to  a  represent¬ 
ative  Edmonton  sample  of  two  hundred  seventy-one  Grade  VII 
children,  and  the  results  analyzed  in  relation  to  socio¬ 
economic  status,  verbal  intelligence,  achievement,  and 
school  marks.  City-wide  testing  surveys  conducted 
during  the  previous  four  years  provided  data  on  the 
same  children,  This  made  longitudinal  comparisons 
possible.  Eight  criteria  for  evaluating  the  tests 
in  terms  of  "culture-reduction”  and  estimation  of  gen¬ 
eral  intelligence  were  set  up,  and  hypotheses  about 
the  ability  of  each  test  to  meet  such  criteria  were 
formulated. 

The  main  findings  and  conclusions  of  the  inves¬ 
tigation  are  listed  below0 

Results  of  Analysis . 

1.  Principal  components  factor  analysis  of  all 
variables  used  in  the  Grades  VI  and  VII  testing  pro¬ 
grams  revealed  five  factors.  A  large  general  factor 
(g)  accounted  for  over  fifty  per  cent  of  the  common 
variance,  while  narrower  group  and  environmental  fac¬ 
tors  were  interpreted  as  verbal-educational  (v:ed), 
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numerical  (n),  socio-economic  status  (w)  and  speed 
(sp) . 

2.  High  g  loadings  accompanied  by  negligible 
group  factor  loadings  were  shown  by  three  culture- 
reduced  tests:  the  Progressive  Matrices,  the  Cattell 
Test  of  g,  and  the  Lorge-Thorndike  Figure  Analogies 
sub-test.  The  remainder  of  the  variables  analyzed 
showed  g  saturations  below  .7  or  loadings  on  less 
general  ability  and  environmental  factors. 

3.  Two  typical  verbal  intelligence  tests,  the 
California  Test  of  Mental  Maturity  and  the  Laycock 
Mental  Ability  Test,  differentiated  between  socio¬ 
economic  levels  significantl}/-  more  than  did  eight  of 
the  culture-reduced  tests  selected  for  this  study. 

This  finding  is  consistent  with  the  contention  of  Sells 
(1951)  and  others  that  verbal  intelligence  tests  are 
culturally  weighted  in  favour  of  children  from  higher 
socio-economic  levels. 

A.  The  relationship  between  socio-economic 
status  and  the  abilities  sampled  by  two  forms  of  the 
California  Test  of  Mental  Maturity  increased  signifi¬ 
cantly  in  the  four  years  between  testing  occasions. 

By  contrast,  no  change  was  observed  in  a  correspond¬ 
ing  analysis  of  two  forms  of  the  Progressive  Matrices 


.  . 

. 

'• 


■ 


.  .  .  1  ‘  i 


.  '  „'■■■  .  -  • 

O 

.  .  i ...  ■  ,■  . 

,  . 

-  .TO  ....  ■ 

,  :  ..  .  ...  ,  ..  - 

;  >"j.  ,  .1  .W.j:  '  ..  (t 

O  |  J  ..  t  b.  ,, 

.  .  ......  0  ■  ....  : 


■  -  . 

:  r.j  •  . 


135 


This  conclusion  supports  the  hypothesis  that  verbal 
intelligence  tests  are  more  sensitive  to  differences 
in  cultural  environment  than  are  intelligence  tests 
of  the  Matrices  type. 

5.  Items  from  the  verbal  test  of  the  Calif u 
ornia  Test  of  Mental  Maturity  administered  in  Grade  VII 
discriminated  in  irregular  fashion  between  socio¬ 
economic  levels.  This  finding  suggests  the  unsuit¬ 
ability  of  certain  items  for  estimating  the  intellec¬ 
tual  potential  of  lower  class  children.  By  contrast, 
the  culture-reduced  Progressive  Matrices  showed  sig¬ 
nificantly  less  variable  socio-economic  discrimination 
indices,  a  result  which  is  compatible  with  the  hypothesis 
that  it  is  measuring  aspects  of  ability  less  influenced 
by  cultural  advantage. 

6.  Verbal  intelligence  tests  showed  higher 
correlations  with  current  school  achievement  than  did 
culture-reduced  measures.  Since  measures  of  school 
success  are  culturally  weighted,  and  are  frequently 
used  for  verbal  intelligence  test  validation,  this 
result  is  to  be  expected.  However,  the  correlations 
of  the  majority  of  the  culture-reduced  measures  were 
high  enough  to  encourage  confidence  in  their  use  for 
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children  from  culturally  deprived  environment  partic¬ 
ularly  when  adaptive  treatments  are  possible. 

7.  i'he  Coloured  Progressive  Matrices,  admin¬ 
istered  in  Grade  III,  predicted  average  Grade  VII 
school  marks  as  efficiently  as  did  the  culture-loaded 
California  Test  of  Mental  Maturity.  This  finding  is 
consistent  with  the  contention  that  culture-reduced 
tests  of  the  general  intellectual  ability  factor  are 
as  useful  as  culture-loaded  intelligence  tests  when 
prediction  is  to  non-specific  and  changing  criteria. 

Major  Conclusions . 

1.  On  the  basis  of  the  criteria  used  in  this 
investigation,  the  test  which  has  been  identified  as 
the  most  useful  culture-reduced  measure  of  intelligence 
is  Raven’s  Standard  Progressive  Matrices.  This  test 
satisfied  the  criteria  of 

a.  high  g  loading 

b.  low  correlation  with  socio-economic  status 

c.  absence  of  group  or  environmental  factors 

do  lack  of  dependence  on  acquired  information 

e.  medium  correlation  with  school  achievement 

f.  regular  pattern  of  item  discrimination  across 
socio-economic  levels 
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g.  no  change  in  relationship  with  socio-economic 
status  over  four  years 

h.  ability  to  predict  to  a  non-specific  crit¬ 
erion  of  general  school  achievement  over  a 
period  of  four  years  . 

Furthermore,  the  Matrices  test  is  unspeeded, 
intrinsically  interesting,  easy  to  administer  and 
suitable  for  individual  or  group  use.  Also,  it  con¬ 
sistently  shows  reliability  figures  over. 35  (Burke, 

1953).  This  test  can,  therefore,  be  recommended  for 
estimating  the  general  intellectual  ability  of  chil¬ 
dren  from  low  socio-economic  levels  in  an  urban  set¬ 
ting  such  as  that  of  Edmonton.  Related  studies  (Rimoldi, 
1943;  Nisbet,  1953)  indicate  the  Matrices’  relative 
freedom  from  environmental  influences  other  than  socio¬ 
economic  . 

2.  Two  other  instruments,  the  Cattell  Test  of 
g  and  the  Lorge-Thorndike  Figure  Analogies  Sub-test 
showed  equal  promiseon  the  first  four  criteria,  but 
slightly  less  on  the  fifth.  The  remaining  criteria 
have  not  been  applied  to  these  two  tests.  The  fact 
that  both  measures  required  less  than  half  the  test¬ 
ing  time  of  the  Progressive  Matrices,  further  indicates 
the  potential  value  of  their  approach. 
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The  inter-correlations  of  these  three  tests 
were  low  enough  to  suggest  that  a  composite  of  all 
of  them  would  produce  a  most  useful  estimate  of  gen¬ 
eral  intellectual  ability. 

Implications 

1.  Theoretical.  The  hypotheses  of  this  study 
were  deduced  from  ten  postulates  which  in  turn  were 
derived  from  previous  research.  Since  all  hypotheses 
were  substantiated,  and  several  additional  findings  of 
relevance  produced,  the  implications  for  the  theoret¬ 
ical  rationale  of  the  study  must  now  be  considered. 

The  present  research  findings,  although  restricted 
to  an  urban  sample  at  the  Grade  VII  level,  are  consis¬ 
tent  with  the  following  general  postulates  of  the 
theory  of  intelligence. 

a.  Individual  differences  in  measured  intelli¬ 
gence  on  conventional  tests  are  a  function  of  environ¬ 
mental  influences  and  familiarity  with  the  content  of 
test  items.  A  postulate  of  this  nature  was  originally 
proposed  by  Binet,  but  is  frequently  ignored  by  test 
administrators  and  school  authorities  responsible  for 
the  selection  and  classification  of  children  from 
various  socio-economic  and  cultural  backgrounds. 
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b.  The  effects  of  socio-economic  backgrounds 
are  such  that  children  from  low  socio-economic  status 
in  an  urban  culture  are  handicapped  in  their  ability 
to  perform  on  conventional  intelligence  tests.  It  is 
not  contended  that  intellectual  differences  between 
socio-economic  levels  are  exclusively  attributable 
to  the  lack  of  vocabulary  and  experienced  required 
to  perform  well  on  these  tests.  The  assumption  of 
hereditary  differences  in  intelligence  has  not  been 
discounted  in  this  investigation.  Furthermore,  social 
status  differences  in  the  development  of  Intelligence  B 
(present  functioning  level)  are  conceivably  accounted 
for,  in  some  measure,  by  the  differential  environmental 
stimulation  to  which  the  children  are  subjected. 
Nevertheless,  the  present  study  supports  the  hypothesis 
that  a  significant  amount  of  the  test  score  variance  is 
attributable  to  familiarity  with  the  materials  used  in 
current  intelligence  tests.  Since  children  from  lower 
socio-economic  levels  commonly  have  different  values 
and  experiences  from  children  in  the  upper  levels 
(Havighurst,  1951),  they  do  not  share  the  same  oppor¬ 
tunities  to  develop  the  academic  vocabulary  and  schol¬ 
astic  skills  necessary  for  success  in  verbal  intelli¬ 
gence  tests.  To  that  extent,  such  children  frequently 
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do  not  reveal  their  potential  as  accurately  as  upper- 
class  children. 

c.  Tests  can  be  devised  to  provide  useful  esti¬ 
mates  of  the  general  intellectual  ability  of  pupils 
regardless  of  their  socio-economic  backgrounds.  The 
three  tests  identified  in  this  study  showed  higher  g 
loadings  and  lower  correlations  with  socio-economic 
status  than  conventional  intelligence  tests,  and  medium 
correlations  with  culturally  weighted  school  marks. 

They  therefore  sample  an  important  aspect  of  intellec¬ 
tual  ability,  yet  are  relatively  independent  of  verbal 
symbolism  and  other  acquired  knowledge. 

d.  A  broad  general  factor  of  intellectual 
ability  (g)  has  the  following  characteristics: 

i.  It  can  be  sampled  through  restricted  and 
unfamiliar  media.  This  postulate,  originally  proposed 
by  Spearman  (1923),  is  supported  by  the  factor  analytic 
results  of  four  variables  in  the  present  study. 

ii.  It  is  less  sensitive  to  environmental 
change  than  other  dimensions.  Thus  the  Progressive 
Matrices,  which  is  predominantly  a  test  of  g,  showed 
significantly  less  sensitivity  to  environmental  dif¬ 
ferences  over  four  years  than  did  the  G TMM .  Moreover, 
the  tests  with  high  g  loadings  and  no  group  factor 
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consistently  showed  the  least  relationship  with 
differences  in  socio-economic  status. 

iii.  It  is  a  useful  construct  in  prediction 
to  non-specific  and  changing  criteria. 

Such  a  postulate  was  supported  by  the  predic¬ 
tion  of  Grade  VII  average  school  marks  by  the  g-loaded 
Coloured  Progressive  Matrices  administered  in  Grade 
III.  Firmer  support  is  seen  in  research  on  the  pre¬ 
dictive  validity  of  the  Matrices  tests  reported  in 
Vernon  and  Parry  (1949)  and  Burke  (195$).  When 
prediction  is  made  to  a  non-verbal  criterion  or  when 
a  change  in  treatments  is  possible,  the  usefulness  of 
the  g  factor  appears  to  be  increased.  However,  more 
research  on  this  postulate  is  necessary. 

These  three  implications,  regarded  collectively, 
support  the  continued  usefulness  of  the  g  construct  in 
the  intellectual  assessment  of  children. 

2.  Direct  Application .  The  three  most  promising 
tests  identified  by  this  study  could  profitably  be  put 
to  immediate  use  in  the  estimation  of  intellectual 
ability  in  the  following  kinds  of  situations. 

a.  Identification  of  talented  students  from 
underprivileged  backgrounds. 
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b.  Classification  of  pupils  from  foreign 
language  backgrounds. 

c.  Predicting  to  long-term  and  non-specific 
criteria,  particularly  when  change  of  treatments  is 
possible. 

d.  Supplementing  verbal  intelligence  measures 
in  the  routine  testing  of  children  from  all  socio¬ 
economic  levels o 

e.  Improvement  of  cross-cultural  comparisons. 

f.  Distinction  between  mental  defect  and  read¬ 
ing  difficulties  due  to  environmental  opportunity  or 
problems  of  a  sensory  or  emotional  nature. 

g.  Holding  intelligence  constant  in  research 
experiments  where  verbal  ability  is  irrelevant. 

It  should  be  added  that  each  of  the  three  tests 
identified  requires  few  instructions,  can  be  adminis¬ 
tered  in  a  class  period,  has  intrinsic  appeal  for  chil 
dren  and  is  easy  to  mark.  Consequently,  each  can  be 
administered  with  a  minimum  of  special  training, a  con¬ 
sideration  which  gives  them  a  real  advantage  in  many 
teaching  situations <> 

3.  Further  Research.  The  fact  that  it  was  pos 
sible  to  identify  a  test  which  would  meet  all  the 
required  criteria  is  encouraging  for  future  test  cons- 
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truction.  The  sampling  of  intellectual  ability  with 
minimum  dependence  on  acquired  skills  seems  to  be  a 
more  reasonable  goal,  and  the  possibility  of  construc¬ 
ting  tests  along  the  lines  suggested  by  this  study  ap¬ 
pears  to  warrant  further  study. 

However,  it  should  be  emphasized  that  the  cul¬ 
tural  influences  examined  in  the  present  research 
were  limited  to  variables  associated  with  differences 
in  socio-economic  status  in  an  urban  setting.  Further 
more,  there  is  still  no  experimental  evidence  on  the 
ability  of  the  culture-reduced  intelligence  tests  to 
identify  children  who  would  benefit  most  from  greater 
environmental  stimulation. 

Therefore,  three  lines  of  future  research 
appear  to  follow  logically  from  the  findings  of  the 
present  investigation. 

1.  There  is  a  need  for  the  construction  of  new 
tests  based  on  the  type  of  items  which  show  most  prom¬ 
ise  in  the  three  culture-reduced  tests  identified. 

For  instance,  one  might  predict  considerable  potential 
for  an  unspeeded  test  which  used  the  Progressive  Mat¬ 
rices  approach  of  testing  what  it  taught  and  which 
supplemented  the  abilities  measured  with  the  most  dis¬ 
criminating  figure  analogy  and  perceptual  series  items 
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2.  The  most  promising  culture-reduced  tests  of 
this  study  need  to  be  evaluated  in  relation  to  the 
environmental  influences  associated  with  urban-rural 
differences,  sensory  deficiences  and  cross-cultural  com¬ 
parisons.  The  suitability  of  these  tests  for  Indian 
and  Eskimo  children,  for  example,  should  be  a  problem 
for  early  investigation. 

3.  Experimental  studies  could  be  designed  to 
demonstrate  the  usefulness  of  the  measure  identified, 
perhaps  by  the  administration  of  remedial  treatment  to 
children  who  show  large  discrepancies  between  their 
culture-loaded  and  culture-reduced  intelligence  quotient. 
Further  long-term  prediction  studies  to  changing  and 
non-specific  criteria  could  well  throw  more  light  on 

the  potential  usefulness  of  culture-reduced  tests  of 
intelligence . 

If  it  were  possible  to  explore  these  implica¬ 
tions  to  the  full,  and  to  produce  a  more  valid  and 
reliable  intelligence  test  which  was  less  sensitive 
to  the  influence  of  acquired  knowledge,  vocabulary 
improvement  or  familiarity  with  upper-middle  class 
experiences,  then  the  contribution  of  mental  testing 
to  the  problems  of  education,  psychology,  neurology, 
and  national  welfare  could  be  substantially  increased. 
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APPENDIX  A 


HOME  INDEX  QUESTIONNAIRE 


1.  Name:  _ _ _ _ _ _ _ 

(Last  Name)  (First  Name”) 

2.  Father1 s  Occupation:  _ 

(Be  Clear.  For  Example:  Sales  clerk  at  Eatons, 
door-to-door  salesman  for  Fuller  Brush,  travelling 
salesman  for  Massey-Ferguson) . 

3.  Mother Ts  Occupation:  _ _ 

A 

4.  Do  you  ever  use  a  language  other  than  English  in  your 

home? 


If  you  do,  (a)  Name  it  here:  _ 

(b)  Circle  the  word  which  tells  how  often 
you  use  it: 

Hardly  ever  Quite  often  Most  of  the  time 

•^5.  About  how  long  have  you  lived  in  Canada?  _ _ 


DIRECTIONS:  In  the  following  questions,  mark  your  answer 
by  putting  a  circle  in  the  right  place,  For 
example,  in  the  question  "Does  your  family 
own  a  car?"  draw  a  circle  around  the  Yes  if 
your  family  does  own  a  car,  and  around  the  No 
if  it  does  not.  Be  sure  to  answer  all  the 
questions . 


1. 

Does 

your  family  own  a  car?  .... 

Yes 

No 

2. 

Does 

your  family  have 

a  garage  or  carport  ? 

Yes 

No 

3. 

Did 

your 

father  go 

to 

high  school?. 

Yes 

No 

4. 

Did 

your 

mother  go 

to 

high  school? 

Yes 

No 

5. 

Did 

your 

father  go 

to 

university? 

Yes 

No 

6. 

Did 

your 

mother  go 

to 

university? 

Yes 

No 

7. 

Is  there 

a  writing 

desk  in  your  home? 

Yes 

No 

± 


Questions  designed  for  a  parallel  study. 


•  ■ 

...  - 

.1  ;w?i  I  . 


■v.\.  '..J'.j  iV  .1  '  *■  ...  c-  : 


L  c,:  :  •  ;. .  '«;} 

.:  l  i  -'i  .;  o  ■ :  dr  r  -x •;  y  v  I 

•  •  .r< ", .buv.'J  uov  yv vo i  cl'Su.a^ 


■.  ..  .  '  '■  •  "...  <  -  .  •' '  .  :u.  ’  'oJ.i  o\!  •  ..i';!  r 

- 

.  i  .  .  i  ■  .■  .  ■  ..  •  .  m :o 

'  :  •  '  0*.  J  j 

.  .'.d  :  ... 


.  .  .  :  .  .'  , 

.  .  .  '  (. 

. 

.  'I  '  .‘V  ;  ■  m.  fij'i  t-  i  .V 

•'  -  ‘  i (  :  "  ■\\c  V  Tf.' 


V.  '  '  '  .  .  I 


208 


HOME  INDEX  QUESTIONNAIRE  (continued) 

8.  Does  your  family  have  Hi  -  Fi  or  record  player?  Yes  No 

9.  Does  your  family  have  a  piano?  ....  Yes  No 

10.  Does  your  family  get  a  daily  newspaper?  .  .  Yes  No 

11.  Do  you  have  your  own  room  at  home?  .  .  .  Yes  No 

12.  Does  your  family  own  its  home?  ....  Yes  No 

13.  Is  there  an  encyclopedia  in  your  home?  .  .  Yes  No 

14.  Does  your  family  have  more  than  100  hard-cover 

books?  (e.g.  4  shelves  3  feet  long)  .  .  Yes  No 

15.  Did  your  parents  borrow  any  books  from  the 

library  in  the  last  year? . Yes  No 

16.  Does  your  family  leave  town  each  year  for  a 

holiday? . Yes  No 

17.  Do  you  belong  to  any  club  where  you  have 

to  pay  fees? . Yes  No 

18.  Does  your  mother  belong  to  any  clubs  or 

organizations  such  as  study,  church,  art, 

or  social  clubs? . Yes  No 

19.  Does  your  father  belong  to  any  such  clubs  or 

organizations?  . Yes  No 

20.  Have  you  ever  had  lessons  in  music,  dancing, 

art,  swimming,  etc.,  outside  of  school?  .  Yes  No 


APPENDIX  B 


CANADIAN  OCCUPATIONAL 

CONSTRUCTED  BY  BLISHEN, 

Occupation 

SCALE 

1958 

Scaled  Score 

Judges 

90.0 

Dentists 

£2.5 

Physicians  and  Surgeons 

Si.  2 

Lawyers 

7S.S 

Engineers  -  Chemical 

77.8 

Actuaries 

77.6 

Engineers  -  Mining 

77.4 

Engineers  -  Electrical 

75.2 

Engineers  -  Civil 

75.0 

Architects 

73.2 

Statisticians  (F) 

72.9 

Engineers  -  Mechanical 

72.6 

Professors 

72.0 

Stock  and  Bond  Brokers 

70.9 

Veterinary  Surgeons 

69. s 

Business  Service  Officers 

69.5 

Statisticians  (M) 

6S.S 

Mining  Managers 

67.9 

Finance  Managers 

67.7 

Osteopaths  and  Chiropractors  (M) 

67.3 

Dieticians  (F) 

67.0 

Professors  (F) 

66.7 

Chemists  and  Metallurgists 

65. s 

Officers  -  Armed  Forces 

65.1 

Air  Pilots 

65.0 

Chemists  and  Metallurgists 

64*  s 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Agriculture  Professionals 

6  4.6 

Electric  -  Gas  and  Water  Officials 

64.7 

Other  Professions 

64.0 

Construction  Managers 

63  .  8 

Wholesale  Trade  Managers 

63.5 

Librarians  (F) 

63.4 

Authors  -  Editors  and  Journalists 

(M) 

63.4 

Manufactoring  Managers 

63.0 

Community  Service  Workers 

62.4 

Social  Welfare  Workers 

62.2 

Osteopaths  and  Chiropractors  (F) 

62.2 

School  Teachers 

62.2 

Librarians  (M) 

62.0 

Accountants  and  Auditors 

61.  8 

Authors  -  Editors  and  Journalists 

(F) 

61.4 

Clergymen 

61.0 

Designers  -  Clothing 

60.6 

Government  Service  Officials 

60.6 

Transportation  Managers 

60.1 

Farmers  (F) 

59.4 

Community  Service  Workers  (F) 

59.1 

Dispatchers  -  Train 

53. 5 

Designers  -  Clothing  (F) 

58.2 

Insurance  Agents 

58.2 

Foremen  -  Communication 

58.1 

Advertising  Agents 

58.0 

Managers  -  N.E.S. 

57.7 

School  Teachers  (F) 

57.6 

Artists  and  Teachers  of  Art 

57.6 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Nurses  -  Graduate  (F) 

57.4 

Real  Estate  Agents  and  Dealers 

57.0 

Social  Welfare  Workers  (M) 

57.0 

Retail  Trade  Managers 

57.0 

Actors  (F) 

56.9 

Commercial  Travellers 

56.7 

Advertising  Agents  (F) 

56.6 

Forestry  Managers 

56.5 

Artists  -  Commercial  (F) 

56.4 

Radio  Announcers 

56.4 

Laboratory  Technicians  -  N.E.S.  (F) 

56.0 

Artists  -  Commercial 

56.0 

Draughtsmen 

56.0 

Brokers  -  Agents  and  Appraisers 

56.0 

Inspectors  -  Communication 

55.0 

Artists  and  Teachers  of  Art  (F) 

55.0 

Surveyors 

55.0 

Recreation  Service  Officers 

54.8 

Purchasing  Agents 

54.8 

Agents  -  Ticket  Station 

54.3 

Laboratory  Technicians  -  N.E.S. 

54.2 

Stenographers  -  Typists  (F) 

54.1 

Conductors  -  Railway 

54.1 

Radio  Operators 

54.0 

Locomotive  Engineers 

54.0 

Photo  Engravers 

54.0 

Music  Teachers  (M) 

53.7 

Teachers  -  N.E.S. 

53.6 

Office  Appliance  Operators 

53.4 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Retail  Trade  Managers  (F) 

53.3 

Telegraph  Operators  (F) 

52.9 

Foremen  -  Mining 

52.8 

Window  Decorators 

52.3 

Nurses  -  Graduate  (M) 

52.2 

Actors  (M) 

52.1 

Stenographers  (M) 

52.0 

Bookkeepers  and  Cashiers  (F) 

51.9 

Forewomen  -  Communication 

51.8 

Foremen  -  Manufacturing 

51.8 

Photographers 

51.8 

Inspectors  -  Construction 

51.7 

Window  Decorators  (M) 

51.7 

Telegraph  Operators  (M) 

51.6 

Petroleum  Refiners 

51.6 

Engravers  -  Except  Photo 

51.4 

Undertakers 

51.3 

Office  Clerks  (F) 

51.2 

Locomotive  Firemen 

51.2 

Bookkeepers  and  Cashiers 

51.2 

Brakemen  -  Railway 

51.1 

Power  Station  Operators 

51.0 

Office  Appliance  Operators 

51.0 

Doctors’  and  Dentists’  Attendants 

50.8 

Motion  Picture  Projectionists 

50.8 

Radio  Repairmen 

50.8 

Captains,  Mates,  Pilots 

50.7 

Foremen  -  Transport  and  Storage 

50.7 

Foremen  -  Commercial 

50.6 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Personal  Service  Officers 

50.5 

Pattern  Makers  (M) 

50.4 

Compositors 

50.4 

Inspectors  -  Metal 

50.4 

Papers  Makers 

50.4 

Photographers  (F) 

50.2 

Policemen 

50.2 

Office  Clerks 

50.2 

Mechanics  -  Aeroplane 

50.1 

Inspectors  -  Metal  Products 

50.0 

Music  Teachers  (F) 

50.0 

Firemen  -  Fire  Department 

49.8 

Pressmen  -  Plate  Printers 

49.8 

Telephone  Operators  (F) 

49 . 6 

Linemen  and  Servicemen 

49.4 

Engineering  Officers  -  On  Ship 

49  •  4 

Baggagemen 

49 . 4 

Transportation  Inspectors 

49.4 

Rolling  Millmen 

49  •  4 

Auctioneers 

49.3 

Inspectors  and  Graders 

49.2 

Farmers 

49.2 

Photographic  Occupations  -  N.E.S. 

49.2 

Collectors 

49.1 

Dental  Mechanics 

49.1 

Sulphite  Cookers 

49.0 

Assemblers  -  Electrical  Equipment  (F) 

48.9 

Operators  -  Electric  Street  Railway 

4S.S 

Stationary  Engineer 

48.7 
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APPENDIX 


B  (continued) 


Occupation  Scaled  S 


Bookbinders 

43. 6 

Tire  and  Tube  Builders  (F) 

43  •  4 

Canvassers  -  Salesmen 

43.2 

Telephone  Operators 

43.2 

Switchmen  and  Signalmen 

43.2 

Opticians 

43.2 

Jewellers  and  Watchmakers 

43.2 

Personal  Service  Workers  (F) 

43.1 

Assemblers  -  Electrical  Equipment 

43.1 

Tire  and  Tube  Builders 

43.1 

Mill  Wrights 

43.0 

Religious  Workers  -  N.E.S. 

43.0 

Fitters  -  Metal  (F) 

47.9 

Milliners 

47.3 

Construction  Foremen 

47.7 

Opticians  (F) 

47.6 

Bus  Drivers 

47.6 

Heat  Treaters 

47.6 

Religious  Workers  -  N.E.S.  (F) 

47.5 

Photographic  Workers  -  N.E.S.  (F) 

47.4 

Machine  Operators  -  Metal  (F) 

47.4 

Boiler  Makers 

47.3 

Jewellers  and  Watchmakers  (F) 

47.2 

Other  bookbinding  Workers  =  N.E.S. 

47.2 

Sales  Clerks 

47.2 

Hoistmen  -  Cranemen 

47.2 

Welders 

47.2 

Mechanics  -  N.E.S. 

47.2 

Mechanics  -  Railroad 

47.2 
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APPENDIX  B  (continued) 

Occupation  Scaled  Score 

Fitters  -  Metal  47.2 

Cutters  -  Textile  Goods  47.2 

Millmen  47.2 

Wire  Drawers  (F)  47.1 

Coil  Makers  47.1 

Riggers  47.1 

Sheet  Metal  Workers  47.1 

Shipping  Clerks  47.0 

Wire  Drawers  46.9 

Other  Ranks  -  Armed  Forces  46. £ 

Electroplaters  46. B 

Plumbers  46.  S 

Motormen  46.7 

Quarriers  46.6 

Machine  Operators  -  Metal  46.5 

Paint  Makers  46.4 

Filers  46.4 

Upholsterers  46.3 

Knitters  46.3 

Wood  Inspectors  46.3 

Barbers  (F)  46.2 

Milliners  (F)  46.2 

Tobacco  Products  Workers  46.2 

Furnacemen  46.2 

Furriers  46.2 

Brothers  46.2 

Paper  Box  Makers  46. 1 

Other  Bookbinding  Workers  (F)  46. 0 

Coil  Makers  46. 0 

Vulcanizers  46. 0 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Liquor  and  Beverage  Workers 

46.0 

Postmen 

45.9 

Meat  Canners 

45.9 

Other  Upholstery  Workers  (F) 

45.8 

Bookbinders  (F) 

45.8 

Transport,  storage,  communication 

Workers 

(F)  45.8 

Polishers  -  Metal 

45.8 

Furriers  (F) 

45.6 

Structural  Iron  Workers 

45.6 

Mechanics  -  Motor 

45.6 

Textile  Inspectors 

45.6 

Cabinet  and  Furniture  Makers 

45.5 

Loom  Fixers 

45.5 

Weavers  -  Textile 

45.4 

Butchers 

45.4 

Miners 

45.4 

Logging  Foremen 

45.4 

Labellers 

45.3 

Nurses  in  Training  (F) 

45.2 

Meat  Canners 

45.2 

Farm  Managers 

45.2 

Plasterers 

45.2 

Textile  Inspectors 

45.1 

Other  Pulp  and  Paper  Workers  (F) 

45.1 

Winders  and  Warpers 

45.0 

Carders  and  Drawing  Frame  Workers 

(F) 

45.0 

Sales  Clerks  (F) 

45.0 

Moulders  -  Metal 

45.0 

Nurses  -  Practical 

45.0 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Cutters  -  Textile  Goods  (F) 

44.9 

Elevator  Tenders  (F) 

44*  $ 

Tailoresses  (F) 

44  •  S 

Textile  -  Inspectors  (F) 

44*$ 

Potmen 

44*  $ 

Timbermen 

44.7 

Prospectors 

44.7 

Oilers  -  Power  Plant 

44.7 

Liquor  and  Beverage  Workers  ( F ) 

44*  6 

Paper  Box  Makers  (F) 

44  •  6 

Kiln  Burners 

44  *  6 

Brick  and  Stone  Masons 

44.6 

Construction  Machine  Operators 

44.5 

Canvassers  (F) 

44  •  4 

Service  Station  Attendants 

44  *  4 

Painters  and  Decorators 

44. 4 

Hat  and  Cap  Makers 

44*4 

Bleachers  and  Dyers 

Spinners  and  Twisters  (F) 

44.3 

Rubber  Shoe  Makers  (F) 

44. 3 

Porters 

44*  2 

Tobacco  Products  Workers 

44 . 2 

Millers 

44.2 

Nurses  Practical  (F) 

44.1 

Finishers  -  Textile  (F) 

44  *  0 

Blacksmiths 

44*  0 

Tailors 

44*  0 

Bakers 

43.8 

Weavers 

43.8 

Rubber  Shoe  Makers 

43.8 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Labellers  (F) 

43.7 

Other  Personal  Service  Workers 

(F) 

43.6 

Barbers 

43.6 

Truck  Drivers 

43.6 

Packers  and  Wrappers 

43.6 

Finishers  -  Wood 

43.6 

Finishers  -  Textile 

43.6 

Tanners 

43.6 

Hat  and  Cap  Makers 

43.3 

Cutters  -  Leather 

43.3 

Commercial  Packers  and  Wrappers 

(F) 

43.4 

Teamsters,  Carriage  Drivers 

43  •  4 

Stone  Cutters 

43.4 

Riveters 

43  •  4 

Butter  and  Cheese  Makers 

43.3 

Chauffeurs 

43.3 

Boiler  Firemen 

43.3 

Spinners 

43.3 

Inspectors  -  N.E.S.  (F) 

43.2 

Postmen  (F) 

43.2 

Waiters 

43.2 

Carpenters 

43.2 

Serving  Machine  Operators 

43.2 

Forest  Rangers 

43.2 

Lock  Keepers  -  Canal  Men 

43.1 

Wood  Turners 

43.1 

Laborers  -  Mines  and  Quarries 

43.1 

Sewing  Machine  Operators  (F) 

43.0 

Brick  and  Stone  Masons  (F) 

43.0 
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APPENDIX  B  ( continued ) 


Occupation 

Scaled  Score 

Textile  Inspectors  (F) 

42.  3 

Machine  Operators  -  Boot  and 

Shoe  (F) 

42. B 

Knitters  (F) 

42.3 

Guards 

42.3 

Winders,  Warpers,  Reelers 

42.3 

Glove  Makers 

42.7 

Cutters  -  Leather  (F) 

42.6 

Elevator  Attendants 

42.5 

Bakers  (F) 

42.4 

Machine  Operators  -  Boot  and 

Shoe 

42  •  4 

Launderers 

42.4 

Firemen  on  Ships 

42 . 4 

Cementers 

42  •  4 

Dressmakers  (F) 

42.3 

Carders  and  Drawing  Frame  Tenders 

42.3 

Box  and  Basket  Makers  (F) 

42.2 

Coopers 

42.2 

Sailers 

42.1 

Harness  and  Saddle  Makers 

42.0 

Nuns 

41  •  3 

Cooks 

41.3 

Janitors 

41.6 

Laundresses  -  Dyers  (F) 

41.4 

Sectionmen  -  Trackmen 

41.4 

Charworkers  -  Cleaners 

41.3 

Paper  Box  -  Envelope  Makers 

41.3 

Sawyers 

41.2 

Longshoremen 

41.2 

Waitresses  (F) 

41.2 

... 
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APPENDIX  B  (continued) 


Occupation 

Scaled  Score 

Glovemakers  (F) 

41.2 

Laborers 

40.  8 

Cooks  (F) 

40.5 

Messengers 

40.2 

Shoemakers 

40.2 

Ushers 

40.1 

Janitors  (F) 

40.0 

Hawkers 

39.3 

Housekeepers,  Matrons  (F) 

38.9 

Hotel,  Cafe,  and  Household  Workers 

33.8 

Newsboys 

38.7 

Guides 

37.8 

Hotel,  Cafe,  and  Household  Workers  (F) 

37.5 

Farm  Laborers 

37.5 

Lumbermen 

37.4 

Charworkers,  Cleaners  (F) 

37.4 

Fishermen 

36.9 

Bootblacks 

36.8 

Fish  Canners 

36.2 

Fish  Canners  (F) 

36.0 

Hunters  and  Trappers 

32.0 
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